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Preface 



This volume marks the 20th anniversary of the New York Number Theory Sem- 
inar (NYNTS). The seminar began to meet in the Spring, 1982 semester at the 
CUNY Graduate Center in midtown Manhattan, and has been meeting contin- 
uously at the Graduate Center for two decades, even as the Graduate Center 
moved from its original location on 42nd Street near Fifth Avenue to tempo- 
rary quarters in an office building next to Grand Central Station to a new and 
elegant building in the former B. Altman department store on Fifth Avenue 
betwen 34th and 35th Streets. 

The seminar was originally organized by Harvey Cohn, David and Gregory 
Chudnovsky, and Melvyn B. Nathanson. In 1982, Harvey Cohn was at City 
College (CUNY) and the Graduate Center, the Chudnovskys were at Columbia, 
and Mel Nathanson was at Rutgers. Today, Harvey has retired to California, 
the Chudnovskys are at Poly techie University of New York, and Nathanson is 
at Lehman College (CUNY) and the Graduate Center. 

For 20 years the NYNTS has tried to present the broad spectrum of number 
theory and related fields of mathematics, from physics to geometry to computer 
science. Mathematics, like other sciences, is a mafia-run enterprise, where the 
local mafias represent currently fashionable fields of research, almost always 
important, but never nearly as important as the reigning dons believe. We have 
always tried to invite not only Fields metalists and other standard speakers, 
but also mathematicians, especially younger and relatively unknown researchers, 
whose theorems are significant and whose work might become the next big thing 
in number theory. We do not attempt to predict the future, but we consciously 
strive to include speakers who are out of the mainstream. 

Since its inception, the proceedings of the New York Number Theory Semi- 
nar have been published by Springer- Verlag, and we are grateful to Springer and 
its mathematics editors for their support of the Seminar. We thank Ina Linde- 
mann, the current mathematics editor of Springer, for her continuing interest 
in NYNTS. 

At various times in the past 20 years the seminar has been supported by 
grants from the NSA Mathematical Sciences Program, the Number Theory 
Foundation, and the Office of the Provost of Lehman College, and we appreciate 
their support. 
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The spanning number and the independence number of a subset of 

an abelian group 



Bela Bajnok 

Department of Mathematics, Gettysburg College 
Gettysburg, PA 17325-1486 USA 
E-mail: bbajnok@gettysburg.edu 

April 29, 2003 



Abstract 

Let A = {ai,a 2 , . . . ,amj be a subset of a finite abelian group G. We call A t -independent 
in G, if whenever 

AiOi T X 2 CL 2 + • • • + XmClm — 0 
for some integers Ai, A 2 , . . . , with 

|Ai I + IA2I + • • • + I Am I < 

we have Ai = A 2 = • • • = A^ = 0, and we say that A is s -spanning in G, if every element g oiG 
can be written as 

p = AiUi -H A202 + • • • -f- XmQ'm 
for some integers Ai , A 2 , • . . , A^ with 

lAi I + IA 2 I + • • • + I Am I < s. 

In this paper we give an upper bound for the size of a f-independent set and a lower bound 
for the size of an s-spanning set in G, and determine some cases when this extremal size occurs. 
We also discuss an interesting connection to spherical combinatorics. 



1 Introduction 

We illuminate our concepts by the following examples. 

Example 1 Consider the set A = {1, 4, 6, 9, 11} in the cyclic group G = Z 25 . We are interested 
in the degree to which this set is independent in G. We find, for example, that l-f-4-}-4 — 9 = 0 
and ll-fll-f9 — 6 = 0, but that such an equation with only three terms from A cannot be found. 
We therefore say that A is 3-independent in G and write ind(A) = 3. It can be shown that A is 
optimal in each of the following regards: 
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• no subset of G of size m > 5 is 3-independent in G (furthermore, A is essentially the unique 
3-independent set in G of size 5) ; 

• no subset of G of size 5 is ^-independent for t > 3 (that is, for t>3, there will always be t, 
not necessarily distinct, elements with a signed sum of 0); and 

• n = 25 is the smallest odd number for which a 3- independent set of size 5 in exists. (In fact, 

it can be shown that has a 3-independent set of size 5, if and only if, n = 20, 22, 24, 25, 26, 
or n > 28.) 



The fact that G = Z25 has this relatively large 3-independent subset is due, as explained later, 
to the fact that 25 has a prime divisor which is congruent to 5 mod 6. 



Example 2 How can one place a finite number of points on the d-dimensional sphere C 
with the highest momentum balance! For the circle the answer is given by the vertices of a 
regular polygon, but the issue is far more difficult for d > 1. For a positive integer n and a set of 
integers A = {01,02 , . . . ,Om}, define the set of n points X(A) = {xi^X 2 , . . . ,x„} with 



1 ( .27rioi. . ,2mai. ,2niarr 

I = —= • cos( ),sm( ), ... ,cos( 

\ n n n 



•),sin( 



27TZOyr 



( i = 1, 2 , . . . , n); thus, for example, for n = 25 and A = (1, 4, 6, 9, 11}, X{A) is a set of 25 points 
on the unit sphere 5^. It can be shown that this X(A) is a spherical 3-design, that is, for every 
polynomial / : — > R of total degree at most 3, the average value of / on equals the arithmetic 

average of / on X{A). We can also verify that X(A) is optimal in that 



• no set of 25 points is a t-design on for t > 3; 

• no set of 25 points is a 3-design on 5^ for d > 9; 

• n = 25 is the minimum odd size for which a 3-design on exists. (It was recently proved 
that an n-point 3-design on exists, if and only if, n = 20, 22, 24, or n > 25.) 



Example 3 Finally, consider A — {3,4} in G = Z25. Note that every element of G can be 
generated by a signed sum of at most three terms oi A: 1 = 4 - 3, 2 = 3 + 3 - 4, . . . , 24 = 3 - 4. We 
therefore call A = {3,4} a 3-spanning set in G = Z25, and write span(A) = 3. Again, our example 
is extremal; it can be shown that 



• no subset of G of size m < 2 is 3-spanning in G; 

• no subset of G of size 2 is s-spanning for s < 3; and 

• n = 25 is the largest number for which a 3-spanning set of size 2 in Zn exists. (Furthermore, 
as we will see, Zn has a 3-spanning set of size at most 2 for every n < 25.) 

In fact, this example ha.s an even more distinguished property: every element of G can be 
written uniquely as a signed sum of at most 3 elements of A\ we call such a set perfect. As a 
consequence of being a perfect 3-spanning set, A is also a maximum size 6-independent set in G. 
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The fact that G = Z25 has a perfect spanning subset of size 2 is due to the fact that 25 is the sum 
of two consecutive squares, as explained later. 

In the subsequent sections of this paper we define and investigate the afore-mentioned concepts 
and statements. Topics similar to spanning numbers (e.g. h-bases) and independence numbers (e.g. 
sum-free sets, Sidon sets, and Bh sequences) have been studied vigorously for a long time, see, for 
example, [9], [13], [15], [20], [22], [23], [27], and various sections of Guy’s book [14]. For general 
references on spherical designs, see [4], [8], [10], [11], [12], [17], [21], and [25]. 



2 Spanning numbers 

Let G be a finite abelian group of order |G| = n, written in additive notation. We are interested 
in the degree to which a given subset of G spans G. More precisely, we introduce the following 
definition. 



Definition 1 Let s be a non-negative integer and A — {oi, ^ 2 , • • • , ^m}- We say that A is an 
s-spanning set in G, if every g G G can be written as 

g = AiUi -I- A2U2 H- • • • -f XmQ'm 

for some integers Ai, A2, . . • , Am 'with 



|Ai| -f IA2I -f • • • -f |Am| < S. 

We call the smallest s for which A is s-spanning the spanning number of A in G, and denote it 
by span(A). 

Equivalently, >1 is an s-spanning subset of G if for every element g G G, we can find non- negative 
integers h and k and elements x and y in G, so that x is the sum of h (not necessarily distinct) 
elements of A, y is the sum of k (not necessarily distinct) elements of A, h-G k < and g = x — y. 

The case 5 = 0 is trivial: the only group G which has a 0-span is the one with a single element; 
therefore, we may assume that s > 1 and n >2. Obviously, A = G is an s-spanning subset of G for 
every 5 > 1. Here we are interested in small s-spanning sets in G; we denote the size of a minimum 
s-spanning set of G by p(G, s). 

For s = 1, it is clear that span(A) = 1, if and only if, for each p G G, A contains at least one 
of g or —g\ in particular, A must contain every element of order 2. Let 0(G, 2) denote the set of 
order 2 elements of G; with this notation we have 

p(0. .) . 10(0,2)1 + 

As a special case, for the cyclic group of order n we have 

p(Z„,l) = [n/2J. 



( 2 ) 
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For 5 > 2, values of p{G^ s) seem difficult to establish, even in the case of the cyclic groups. 
Computational data shows that 



, 2 )={ 



0 if n = 1; 

1 if n = 2, 3, 4, 5; 

2 ifn = 6,7,...,12,13; 

3 ifn = 14,15,...,21; 

4 if n = 22, 23, , 33, and n = 35; 

[5 if n = 34, n = 36, 37, . . . , 49, and n = 51; 



( 3 ) 



and 



0 if n = 1; 

1 if n = 2, 3, . . . , 6, 7; 
p[Zn,3)={ 2 ifn = 8,9...,24,25; 

3 if n = 26, 27, ... , 50, n = 52, and n = 55; 

( 4 ifn = 51,53,54, n = 56,57,...,100, andn = 104. 

(Values marked in bold-face will be discussed later.) 



( 4 ) 



As these values indicate, p(Zn, s) is, in general, not a monotone function of n, though we believe 



that 



P{s) := lim 






n—yoo 71 

exists for every s. The following theorem provides a lower bound for p{G^s) which is of the order 
as n goes to infinity. 

Theorem 2 Let m and s be positive integers, and define a{m, s) recursively by a(m, 0) = a(0, s) = 1 
and 

a(m, s) — a(m — 1, s) -f- a(m, s — 1) -f a{m — 1, s — 1). 

i. We have 



2. If G has order n and contains an s-spanning set of size m, then n < a(m, s). 
Proof. 1. Let us define 

a'(m,s) :=EQ Qz''. 

Jfc=0 ^ ^ / 

Clearly, a'{m, 0) = a'(0, s) = 1; below we prove that a'(m, s) also satisfies the recursion. 
We have 




5 



and 

a'{m — 1, 5 ) = 



g C: ;) (”; - 1 C ; ■) (\- y - (::;)- * (“: >■■ 



5—1 

= E 

A:=0 */ V •'' / k=0 

Next, we add a'{m — 1, s) and a'(m — 1,5 — 1). Note that 



=C:)^'' 

'tty (”; ■) (”■; > - g Cl ■) C"; 

n becomes 



and 



and by replacing A; by A; — 1, this sum becomes 

5-1 



Therefore, 







- E' 

fc =0 

= a'(m,s) — a'(m^s — 1). 






2. Assume that A = {ai, . . . , a„i} is an 5-spanning set in G of size m, and let 

S = {AiOi 4- • • * + \mO“m I Ai, . . . , Am € Z, | Ai | -b • • ’ -f |Am| < 5 }. 

We will count the elements in the index set 

I = {(Ai, • • • , Am) I Ai, . . . , Am € Z, |Ai| H + |Am| < 5}, 
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as follows. For k = 0, 1, 2, . . . ,m, let Ik be the set of those elements of I where exactly k of the 
m coordinates are non-zero. How many elements are in /;t? We can choose which k of the m 
coordinates are non-zero in (^) ways; w.l.o.g. let these coordinates be Ai, A2, . . . , A^. Next, we 
choose the values of |Ai|, IA2I, . . . , |A)fc|: since the sum of these k positive integers is at most s, we 
have (^) choices. Finally, each of these coordinates can be positive or negative, and therefore 



141 = 




2 ^, 



and 



\I\ 



s). 



Since A is s-spanning in G, we must have n = |E| < |/| = a(m, 5). □ 

Theorem 2 thus provides a lower bound for the size of an s-spanning set in G which is of the 
order as n goes to infinity. 

For exact values, we establish the following results. 



Proposition 3 Let s > I be an integer. 

1. jy 2 < n < 2s -h 1, then the set {1} is s-generating in Zn and p(1in, s) = 1. 

2. If 2s 2 < n < 2s^ -f 2s 1, then the set {s, s + 1} is s-generating in Zn and p(Z„, s) = 2. 

3. If n> 2s^ -f 2s -h 2, then p{Zn^ s) > 3. 



Proof. 1 is trivial. To prove 2, let 

S = {Ais -h A2 (s -f 1) I Ai, A2 G Z, |Ai| -h IA2I < s}. 

The elements of S lie in the interval [—(5^ -f- s), (s^ -}- s)] and, since the index set 
/ = {(Ai,A 2) I Ai,A2GZ,|Ai|-f |A2 |<s} 

contains exactly 2s^ 2s -f 1 elements, it sufiices to prove that no integer in [— (s^ -h s), (s^ -h s)] 
can be written as an element of E in two different ways. Indeed, it is an easy exercise to show that 

Ais -l- A2 (s -H 1) = AjS -f- A^(s -f- 1) G S 

implies Ai = A^ and A2 = A2; therefore, the set {s, s -h 1} is s-generating in Zn- As the s-span of a 
single element can contain at most 2s -|- 1 elements, for values n > 2s -h 2 we must have p(Zn, s) = 2. 
Statement 3 follows from Theorem 2 by noting that a(2, s) = 2s^ -f 2s 1. □ 

Let us now examine the extremal cases of Theorem 2. 



Definition 4 Suppose that A is an s-spanning set of size m in G and that a(m, s) is defined as in 
Theorem 2. If |G| = n = a(m, s), then we say that A is a perfect s-spanning set in G. 
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Cases where Zn has a perfect s-spanning set for s = 2 and 5 = 3 are marked with bold-face in 
(3) and (4). Trivially, the empty-set is a perfect s-spanning set in Zi for every s. With (2) and 
Proposition 3, we can exhibit some other perfect spanning sets in the cyclic group. 

Proposition 5 Let m, n, and s be positive integers, and let G ~ Zn- 

1. If n = 2m -f- 1, then the set {1,2,..., m} is a perfect 1 -spanning set in G. 

2. If n — 2s 1, then the set {1} is a perfect s-spanning set in G. 

3. If n = 2s^ -h 2s -I- 1, then the set (s, s -f 1} is a perfect s-spanning set in G. 

Note that the sets given in Proposition 5 are not unique: any element of the set in 1 can be 
replaced by its negative; in 2, the set {a} is perfect for every a which is relatively prime to n; it 
is not difficult to show that another example in 3 is provided by^ = {l,2sH-l} (however, the set 
(s, s -f 1} in Proposition 3 cannot be replaced by (1, 2s -f 1}). We could not find perfect spanning 
sets for s > 2 and m > 3. It might be an interesting problem to find and classify all perfect 
spanning sets. 



3 Independence numbers 

As in the previous section, we let (7 be a finite abelian group of order |G| = n, written in additive 
notation, and suppose that A is a subset of G. Here we are interested in the degree to which A is 
independent in G. More precisely, we introduce the following definition. 

Definition 6 Let t he a non-negative integer and A = (ai, 03 , . . . , am}- We say that A is a t- 
independent set in G, if whenever 

Aiai -h A2a2 -f- • • • -t- \mO>m — 0 

for some integers Ai, A 2 , . . . , Am with 

|Ai| -f IA 2 I + • • • + |Am| < ^5 

we have Ai = A 2 = • • • = Am = 0. We call the largest t for which A is t-independ-ent the indepen- 
dence number of A in G, and denote it by ind(A). 

Equivalently, A is a t- independent set in G, if for all non-negative integers h and k with h-\-k < t, 
the sum of h (not necessarily distinct) elements of A can only equal the sum of k (not necessarily 
distinct) elements of A in a trivial way, that is, h = k and the two sums contain the same terms in 
some order. 

Here we are interested in the size of a maximum f-independent set in G\ we denote this by 
q(G,t). 
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Since 0 < ind(A) < n - 1 holds for every subset A of G (so no subset is “completely” indepen- 
dent), we see that 0) = n and q{G,n) =0. It is also clear that ind(A) = 0, if and only if, 

0 G >1, hence 

q(GA)=-n-l. (5) 

For the rest of this section we assume that t >2. 

We can easily determine the value of q{G, 2) as well. First, note that A cannot contain any 
element of {0} UOrd(G, 2) (the elements of order at most 2); to get a maximum 2-independent set 
in G, take exactly one of each element or its negative in G\ Ord(G, 2) \ {0}, hence we have 

As a special case, for the cyclic group of order n we have 

g(Zn,2) = L(^-l)/2j. 

Note that if Ord(G, 2) U {0} = G then g(G, 2) = 0; for n > 2 this occurs only for the elementary 
abelian 2-group. If Ord(G, 2) U {0} ^ G then, since Ord(G, 2) U {0} is a subgroup of G, we have 

1 < |Ord(G, 2)1 -h 1 < n/2, and therefore we get the following. 

Proposition 7 If G is isomorphic to the elementary abelian 2-group, then q{G,2) = 0. Otherwise 

in < q(G, 2) < in. 

Let us now consider t — 3. As before, if G does not contain elements of order at least 4, then 
q(G, 3) = 0; this occurs if and only if G is isomorphic to the elementary abelian p-group for p = 2 
or p = 3. In [3] we proved the following. 

Theorem 8 ([3]) If G is isomorphic to the elementary abelian p-group for p = 2 or p — 3, then 
q(G, 3) = 0. Otherwise 

in < q(G, 3) < in. 

These bounds can be attained since g(Zg, 3) = 1 and g(Z 4 , 3) = 1. 

For the cyclic group Z„, we can find explicit 3-independent sets as follows. For every n, the 
odd integers which are less than n/3 form a 3-independent set; if n is even, we can go up to (but 
not including) n/2 as then the sum of two odd integers cannot equal n. We can do better in one 
special case when n is odd; namely, when n has a prime divisor p which is congruent to 5 mod 6, 
one can show that the set 

|jni + 2 i 2 + l|n=0,l,...,^-l, 12 = 0,1,...,^! (8) 

is 3-independent. It is stirprising that these examples cannot be improved, as we have the following 
exact values. 



( 6 ) 

(7) 
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Theorem 9 ([3]) For the cyclic group G — Zn we have 



^(Zn, 3) — < 




LtJ 



ifn is even, 

if n is odd, has prime divisors congruent to 5 
and p is the smallest such divisor, 
otherwise. 



(mod 6), 



For t > 4, exact results seem more difficult. With the help of a computer, we generated the 
following values. 



g(^n, 5) 



and 



^n,4)= ^ 



0 if n = 1, 2, 3, 4; 

1 ifn = 5,6,...,12; 

2 if n = 13, 14,..., 26; 

3 if n = 27, 28, ... , 45, and n = 47; 

4 if n = 46, n = 48, 49, . . . , 68, and n = 72, 73; 

5 if n = 69, 70, 71, and n = 74, 75, . . , , 102; 



( 9 ) 



0 if n = 1,2, 3,4,5; 

1 if n = 6, 7, . . . , 17, and n = 19, 20; 

2 if n = 18, n = 21, 22, . . . , 37, n = 39, 40, 41, n = 43, 44, 45, 47; 

3 if n = 38, 42, 46, n = 48, 49, ... , 69, n = 71, 72, 73, 75, 76, 77, 79, 81, 83, 85, 87; 

( 10 ) 



6 ) 



'0 if n = 1,2,3,. ..,6; 

1 if n = 7, 8, 9,..., 24; 

' 2 ifn = 25, 26, 27,..., 69; 

3 if n = 70, 71, ... , 151, and n = 153, 154, 155, 158, 159, 160. 



( 11 ) 



(Values marked in bold-face will be discussed later.) 

Again we see that 9(Z„, t) is not, in general, a monotone fimction of n; although for even values 
of t the sequence seems to possess more regularity and we conjecture that 



Q(t) := 



lim 

n— kCXD 



n 



exists for every even t. The following theorem establishes an upper bound for q(G, s) which is of 
the order as n goes to infinity. 



Theorem 10 Let m and t he positive integers, t > 2, and let us denote 




a{m, t/2) 

a(m, [t - l)/2) -f a(m - 1, (< - l)/2) 



ift is even, 
ift is odd. 



where a(m, t) is defined in Theorem 2. If G has order n and contains a t-independent set of size 
m, then n > q(m, t). 
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Proof. Assume that A = {ai, . . . , am} is a t- independent set in G of size m, and define 

E = {Aiai H f- \mO>m I Ai, . . . , Am € |Ai| H f- |Arn| < LV^J} 

and 

I = {(Al, • • • , Am) I Ai, . . . , Am G Z, |Ai| H 1- |Am| < U/2J}- 

As in the proof of Theorem 2, we have |/| = a(m, \ t/2\). Since A is a t-independent set in G, 
the elements listed in S must be all distinct, hence n > |S| = |/| = a(m, \t/2\). If t is even, we are 
done. 

Now let 

S' = {Aiai -\ 1- Am^m I Al, . . . , Am € Z, Al > 1, Al -f IA 2 I H -h 1 Am| = L^/2J + 1} 

and 

‘ ? ^m) I Al, . . . , Am e Al > 1, Al + IA 2 I H h |Am| = LV2j + !}■ 

We will count the elements in the index set |/'| as follows. For A; = 0, 1, 2, . . . , m — 1, let Ik be 
the set of those elements of /' where exactly k of the m — 1 coordinates A 2 , . . . , Am are non-zero. 
An argument similar to that in the proof of Theorem 2 shows that 

hence 

I''! - E (”•; ‘) (T)“* - “<”• - '• 

If t is odd, then the elements fisted in S' must be distinct from each other and from those in S as 
well, thus n > |S| -h |S'| = |/1 + |7'| = a(m, Lt/2J) + a(m - 1, [t/2j). □ 

Theorem 10 thus provides an upper bound for the size of a t-independent set in G which is of 
the order as n goes to infinity. 

For exact values, we establish the following results. 

Proposition 11 Let t >2 be an integer. 

1. If 1 < n <t, then q{Zn^ t) = 0. 

2. If t -\-l < n < [t^/2j -h tj then the set {1} is t-independent in Zn and q{Zn,t) = 1. 

3. (a) Suppose that t is even. If n > t^/2 + t H- 1, then the set {t/2,i/2 4- 1} is t-independent 

in Zn and g(Zn,t) > 2. 

(h) Suppose that t is odd. If n — — l)/2 + 1 H- 1, then the set {l,t} is t-independent in 

Zn and q{Zn^t) = 2. 
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Proof. Let q{m,t) be defined as in Theorem 10. Since q{lft) = t + 1, oin first claim follows 
from Theorem 10. To prove 2, note that if n > t-H 1, then {1} is t-independent in Zn; furthermore, 
q{2,t) = L^V2J + t + l. 

Now let t be even, and assume that n > t^/2 + 1 4- 1. We define 

S = {^i2 ^ ^ ^ ~ 

The elements of S lie in the interval [— (t^/2 + 1), (t^/2 + t)] and therefore, to prove 3 (a), it suffices 
to show that 

0 = Ai- -h A2(- + 1) € S 

implies Ai = A 2 = 0, which is an easy exercise. Statement 3 (b) is essentially similar. □ 

We now turn to the extremal cases of Theorem 10. 

Definition 12 Suppose that A is a t -independent set of size m in G and that q{m, t) is defined as 

in Theorem 10. If \G\ = n = q{m^t), then we say that A is a tight t-independent set in G. 

Cases where Zn has a tight t-independent set for t = 4, t = 5, and t = 6 are marked with 
bold-face in (9), (10), and (11). Trivially, the empty-set is a perfect t-independent set in Zi for 
every t. With (5), (7), Theorem 9, Proposition 11, and one other (sporadic) example, we have the 
following tight t- independent sets in the cyclic group. 

Proposition 13 Let m, n, and t be positive integers, and let G = Zn- 

1. If n = 2, then the set {1} is a tight 1-independent set in G. 

2. If n = 2m 4- 1, then the set {1,2, ... , m} is a tight 2-independent set in G. 

3. If n = 4m, then the set {1,3,..., 2m — 1} is a tight 3-independent set in G. 

4 . If n = t-\-l, then the set {1} is a tight t-independent set in G. 

5. Let n = \t^ /2\ 4- 1 + 1. If t is even, then the set (t/2,t/2 -f 1} is a tight t-independent set in 
G; if t is odd, then the set {l,t} is a tight t-independent set in G. 

6. If n = 38, then the set (1, 7, 11} is a tight b-independent set in G. 

Proposition 13 contains every tight (non-empty) t-independent set that we could find so far; in 
particular, we could not find tight t-independent sets for t > 4 and m > 3 other than the seemingly 
sporadic example listed last. The problem of finding and classifying all tight t-independent sets 
remains open. 

As it is clear from our exposition, there is a strong relationship between s-spanning sets and 
t-independent sets when t is even. Namely, we have the following. 
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Theorem 14 Let s and t positive integers, t even. Let A be a subset of G, and suppose that 
span(-4) = s and ind(i4) = t. 

1. The order n of G satisfies a{m,t/2) < n < a(m,s). 

2. We have t < 2s. 

3. The following three statements are equivalent. 

(i) t = 25 / 

(a) A is a perfect s -spanning set in G; and 
(Hi) A is a tight t-independent set in G. 

The analogous relationship when t is odd is considerably more complicated and will be the 
subject of future study. 

4 Spherical designs 

Here we discuss an application of the previous section to spherical combinatorics. We axe interested 
in placing a finite number of points on the d-dimensional sphere C with the highest 

momentum balance. The following definition was introduced by Delsarte, Goethals, and Seidel in 
1977 [8]. 



Definition 15 Let t be a non-negative integer. A finite set X of points on the d-sphere C 
is a spherical t-design, if for every polynomial f of total degree t or less, the average value of f 
over the whole sphere is equal to the arithmetic average of its values on X. 



In other words, X is a spherical t-design if the Chebyshev-type quadrature formula 

( 12 ) 

is exact for all polynomials f : of total degree at most t {erd denotes the surface measure 

on S^). 

The concept of f-designs on the sphere is analogous to t — {v,k,X) designs in combinatorics (see 
[24]), and has been studied in various contexts, including representation theory, spherical geometry, 
and approximation theory. For general references see [4], [8], [10], [11], [12], [17], [21], and [25]. 
The existence of spherical designs for every t and d and large enough n = |X| was first proved by 
Seymour and Zaslavsky in 1984 [26]. 

A central question in the field is to find all integer triples (t, d, n) for which a spherical t-design 
on exists consisting of n points, and to provide explicit constructions for these parameters. 
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Clearly, to achieve high momentum balance on the sphere, one needs to take a laxge number of 
points. Delsarte, Goethals, and Seidel [8] provide the tight lower bound 

- ‘-“V U/2J L(*-i)/2 ;■ 

We shall refer to the bound Nf in (13) as the DOS bound. Spherical designs of this minimum size 
are called tight. Bannai and Damerell [5], [6] proved that tight spherical designs for d > 2 exist 
only for t = 1, 2, 3, 4, 5, 7, or 11. All tight t-designs are known, except possibly for t = 4, 5, or 7. In 
particrdar, there is a unique 11— design (d=23 and n = 196560). 

Let us now attempt to construct spherical designs. One’s intuition that the vertices of a regular 
polygon provide spherical designs on the circle is indeed correct; more precisely, we have the 
following. 

Proposition 16 Let t and n be positive integers, 

1. Ifn<t, then there is no n-point spherical t-design on S^. 

2. Suppose that n > t -f- 1. For a positive integer j, define 

xi ;= (cos(^),sin(?^)) . (14) 

\ n ^ / 

Then the set Xn := {zn|i = 1, 2, . . . , n} is a t-design on S^. 

Proof. 1 follows from the DCS bound as Nf = t -f 1. To prove 2, we first note that, using 
spherical harmonics, one can prove (see [8]) that, in general, a finite set X is a spherical t-design, 
if and only if, for every integer 1 < A; < t and every homogeneous harmonic polynomial / of total 
degree A;, 

:x.£X 

(A polynomial is harmonic if it is in the kernel of the Laplace operator.) The set of homogeneous 
harmonic polynomials of total degree k on the circle, Harmfc(5^), is a 2-dimensional vector space 
over the reals and is spanned by the polynomials Re( 2 r^) and Im( 2 r*^) where z x y/-^y (we can 
think of the elements of X and as complex numbers) . Therefore, we see that X is a t-design on 
if and only if, 

E-''=o 

zex 

for A: = 1, 2, . . . , t. With Xn as defined above, one finds that 

n f 0 if A; ^ 0 mod n, 

= { 

j=i ^ n if A; = 0 mod n. 

Therefore, Xn is a t-design on if and only if. A; ^ 0 mod n for A; = 1,2, ... ,t (using the 
terminology of our last section, if and only if, {1} is a t-independent set in Zn), or n > t 4- 1- □ 
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A further classification of t-designs on the circle can be found in Hong’s paper [18]; he proved, 
for example, that if n > 2t + 3, then there are infinitely many t-designs on which do not come 
from regular polygons. 

We now attempt to generalize Proposition 16 for higher dimensions. For simplicity, we assume 
that d is odd, and let m = (d + l)/2. (The case when d is even can be reduced to this case by a 
simple technique, see [2] or [19].) 

Let ai, 02 , . . . , flm be integers, and set A ■:= {oi, 02 , . . . , a^}. For a positive integers n, define 
:= (z^(ai),z^(a 2 ),...,zJ(a„)) | j = 1, 2, . . . ,n| , (15) 

where, hke in (14), 

zi(oj) ;= (cos(— Oi),sin(— aj)) . 

\ n n J 

Note that Xn(A) C S^. In [2] we proved the following. 

Theorem 17 ([2]) Let t, d, and n be positive integers with t < ^, d odd, and set m = (dH- l)/2. 
For integers oi, 02 ? • • • ? o.m, define Xn{A) as in (15). If A is a t-independent set in Zn, then Xn(A) 
is a spherical t-desion on S^. 

Theorem 17 yields the following results. 

Corollary 18 Let n and d he positive integers, d odd, and set m = (d + l)/2. 

1. (a) If n — 1, then there is no n-point spherical 1-design on S^. 

(b) If n > 2, define ai = 1 for 1 < i < m. Then the set Xn{A), as defined in (15), is a 

spherical 1- design on S^. 

2. (a) If n < d-{- 1, then there is no n-point spherical 2-design on S^. 

(h) If n > d + 2, define ai = i for 1 < i < m. Then the set Xn(A), as defined in (15), is a 

spherical 2- design on S^. 

3. (a) If n <2d-\- 1, then there is no n-point spherical 3-design on S^. 

(b) If n > 2d -{-2 is even or if n > 3d -{■ 3 is odd, define Oj = 2i -f 1 for 1 < i < m; if 

n ^ ^ (3d + 3) 

p -f 1 

where p is a divisor of n which is congruent to 5 mod 6, choose A to be any m elements 
of the set in (8). In each case the set Xn(A), as defined in (15), is a spherical 3-design 
on S^. 
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Proof. Parts (a) are from the DGS bounds Nf for t < 3; parts (b) follow from Theorem 17 
since, by (5), (7), and the paragraph before Theorem 9, the sets specified are t-independent for 
t = 1, 2, and 3, respectively (note that in all cases of 2 and 3, m = (d -f l)/2 < q{Zn, t)). □ 

Part 3 of Corollary 18 leaves the question of existence of 3-designs open for some odd values of 
n. Note that the minimum value of 

^( 3 . + 3 ) 

is 5(d -f l)/2 (when n is divisible by 5). In [2] we proved that a spherical 3-design on (d odd) 
exists for every odd value of n > 5(d -h l)/2, and conjectured that 3-designs do not exist with 
2(d 4- 1) < n < 5(d -H l)/2 and n odd. This conjecture is supported by the numerical evidence of 
Hardin and Sloane [16]. A recent result of Boumova, Boyvalenkov, and Danev [7] proves that no 
3-design exists of odd size n with n <w 2.32(d 4-1). In particular, the case d = 9 of Example 2 
in our Introduction is completely settled: 3-designs on n points on exist, if and only if, n > 20 
even, or n > 25 odd. 

The application of t- independent sets to spherical t-designs seems more complicated when t > 4, 
and wiU be the subject of an upcoming paper. 

Acknowledgments. The author expresses his gratitude to his students Nicolae Laza for 
valuable computations and Nikolay Doskov for an improvement of Proposition 3. 
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A formula related to the Probenius problem in two 

dimensions ^ 

Matthias Beck and Sinai Robins ^ 



1 Introduction 



Given positive integers oi, . . . , a„ with gcd(oi, . . . , o„) = 1, we call an integer t 
representable if there exist nonnegative integers mi , . . . , m„ such that 



J = 1 

In this paper, we discuss the linear diophantine problem of Probenius: namely, 
find the largest integer which is not representable. We call this largest integer the 
Frobenius number p(ai, . . . , On)- We study a more general problem: namely, 
we consider N{t), the number of nonnegative integer solutions (mi, . . . ,mn) to 
t for any positive integral t. In this paper, we concentrate on two 
dimensions and obtain the formula 






Here 



{{x)) = X - [x] - 1/2 

is called a sawtooth function, a^^ai = 1 (mod 02 ), and 03^02 = 1 (mod ai). 
Prom this apparently new identity, we immediately recover and extend two well- 
known results: p(ai, a 2 ) = flia 2 — ai ~ 02 ([Sy]), and exactly half of the integers 
between 1 and (ai — l)(a 2 — 1) are representable ([Ni-Wi]). Note that we can 
rephrase the definition of ^( 01 , 02 ) to be the largest zero of N{t), 



More generally, we say that an integer t is A;-representable if N{t) = fc; that 
is, t can be represented in exactly k ways. Define Qk = ^ib(ai, . . . ,an) to be 
the largest A;-representable integer. It is fairly easy to see that for each fc, 
eventually all integers are ^-representable. Hence Qk is well-defined, and ev- 
ery integer greater than Qk is representable in at least A: -f 1 ways. In par- 
ticular yo(ai, • • • ,fln) = ^(<^ 1 , • • . jttn). We prove statements about gk simi- 
lar in spirit to the two classical results mentioned above. Finally, we define 
Sk = {t £ N : N{t) > k}. It is easy to see that Sk is a semigroup for all posi- 
tive integers k. Note that 5o D 5i D ^2 D . . . and hence provides an interesting 
filtration of the positive integers. This filtration forms an arithmetic progression 
in two dimensions but becomes nonlinear in dimension greater than two. 

^Preprint, December 13, 1999. Keywords', the linear diophantine problem of Probenius, 
rational polytopes, lattice points. 

^Dept. of Mathematics, Temple University, Philadelphia, PA 19122 
mat'thiasCma'th . 'temple . edu , srobinsChnath . temple . edu 
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2 Proof of the formula 



The Probenius setting can be translated into a lattice point problem: we look 
at integer dilates of the rational polytope 

f 

p = / (xi, . . . , Xn) G : Xj > 0, ^2, ^3^3 = 1 

I 

and ask for the largest such dilate which contains no lattice point. In the two- 
dimensional case, we want to study 

N(t) = # {(m,n) G : m,n > 0, am -h 6n = t} , 

which is the lattice point count on the hypothenuse of the triangle tT, where 

T = {{x,y) e : x,y > 0, ax -f < l} . 

We compute the quantity N{t) by partial fractions, inspired by their applica- 
tions to Dedekind sums in [Ge]. We note that one does not have to think of 
N(t) as the lattice point count of a polytope to understand the proof of the 
following theorem; however, this geometric interpretation was the motivation 
for our proof. 

Theorem 2.1 



Proof. N{t) is the constant coefficient of the generating function 



fiz) := 




(1 



We will expand this function into partial fractions: 



1 

z“) (1 - ■ 



/w = E 

A«=l 



Ax 
z — X 



+ E 

A*»=l 



Cl C2 ^ Dk 



(Alternatively, we could shift the constant coefficient to the residue and use the 
residue theorem.) Here, ' means we omit the trivial root of unity A = 1. Note 
that the nontrivial roots of unity yield simple poles, since a 2 tnd b are relatively 
prime. The reason for doing this is the following: the constant coefficient of the 
right-hand side, and thus iVt(p,g), equals 
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A«=l 



( 1 ) 



A*»=l 



The computation of the coefficients -4 a for a nontrivial a’th root of unity A is 
straightforward: 



A\ = lim 



z — A 



z-^x {\ — z^) {1 — z^) a(l — A^)A*”^ 

Similarly, we obtain for a nontrivial 6’th root of unity 

1 



JBa = - 



6(l~A“)A^-i 



The coefficients C 2 and C\ are simply the two leading coefficients of the Laurent 
series of /. Using the expansion 



it is easy to see that 
C2 = 



ab 



and 



Ci = 



-L -L - i. _ 

ab 2a 2b ab 



Hence we can rewrite (1) as 



We claim that 



iE'(rrW = -0-^- 

from which the statement of the theorem follows from (2) by 

1 ^ ^ ^ f f ^ L 

«A ^1 (1~A)A^“^^ VV a JJ 2a • 



(2) 

(3) 



It remains to prove (3), which is equivalent to the well-known finite Fourier 
series for the sawtooth function (see, e.g., [Ra-Gr]). For sake of completeness 
we give a short proof of (3), again based on lattice point considerations: consider 
the interval 

I := 1^0, = {x G R : x > 0, ax < ^} , 

viewed as a one-dimensional polytope. The lattice point count in X is clearly 



#(2:nz) 



+ 1 



a 



(4) 
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On the other hand, we can write this number, by applying the above ideas to 
the interval X, as the constant coefficient of 



(1- z)z^ ' 

If we expand this function into partial fractions in the exact same manner as 
above and compare constant coefficients, which equal the lattice point count, 
we obtain 

#(inz)=- + - + - + - X)' 

\“ = 1 ^ 



Comparing (4) with (5) yields (3). 



□ 



3 Consequences of the formula for N{t) 



Prom Theorem 2.1, we can derive two basic results on the Probenius problem: 
Pirst, Sylvester’s result ([Sy]) is a straightforward consequence: 

Corollary 3.1 g{a, b) = ab — a — b. 



Proof. We have to show that N{ab — a — 6) = 0 and that N{t) > 0 for every 
t > ab — a — b. Pirst, by the periodicity of the sawtooth function. 



N{ab -a-b) 




ab — a — b 
ab 



-(( 



b ^ (ab — a — 6) 



)) 






^(a6 — a — 6^^ 



For any integer m, ((^)) < ^ ~ Hence for any positive integer n, 



Ar/r r ab -a — b-\-n (\ 1\ fl 1\ n 

N(.ab-a-l, + n)> - -j - - -j = _ > 0. 



□ 
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Corollary 3.2 Exactly half of the integers between 1 and (a — 1)(6 — 1) are 
representable. 

Proof. We first claim that, if t 6 [1, oft — 1] is not a multiple of a or ft, 

N{t) + N{ab -0 = 1- (6) 



This identity follows directly from Theorem 2.1: 



mxm 



= 1 - 4 + 

ab 

= 1 - iv(0 



Here, (★) follows from the fact that ((— x)) = —((x)) if x 0 Z. This shows that, 
for t between 1 and oft — 1 and not divisible by a or ft, exactly one of t and ab — t 
is not representable. There are 



oft — a~ft-fl = (o — l)(ft — 1) = p(o, ft) -f 1 

integers between 1 and oft — 1 which are not divisible by a or ft. Finally, we note 
that N{t) > 0 if t is a multiple of a or ft, by the very definition of N{t). Hence 
the number of non-representable integers is |(o — l)(ft - 1). □ 

Note that we proved even more. By (6), every positive integer less than aft has at 
most one representation. Hence, the representable integers in the above corollary 
are uniquely representable. We now study integers that are A;-representable. 



Corollary 3.3 N{t + aft) = N(t) 4- 1. 



Proof. By the periodicity of the sawtooth function, 

t ab f fb~^{t-^ ab ) " 









(P)) -((=?))=»» 



+ 1 



□ 
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Corollary 3.4 5 *(o, b) = {k + l)ah — a — b. 



Proof. By the preceeding corollary, gk+i = 9k + ob. The statement follows now 
inductively. □ 



Corollary 3.5 Given k>2, the smallest k-representable integer is ab{k — 1). 



Proof. Let n be a noimegative integer. Then 
N(ab(k-1) -n) = 

If n = 0, (7) equals k. If n is positive, we use ((x)) > “*| to see that 
N{ab{k -l)-n)<k-^<k. 



□ 



All nonrepresentable positive integers lie, by definition, in the interval [l,^(a, 5)]. 
It is easy to see that the smallest interval containing all uniquely representable 
integers is [min(a,6),^i]. For A: > 2, the corresponding interval always has 
length 2a6 — a — 6 4* 1, and the precise interval is given next. 



Corollary 3.6 Given A: > 2, the smallest interval containing all k-representable 
integers is [pfc -2 + a + 5,pjb]. 



Proof. By Corollaries 3.4 and 3.5, the smallest integer in the interval is 

ab{k — 1) = 9 k -2 + a -h 6 . 

The upper bound of the interval follows by definition of gk- Q 



Corollary 3.7 There are exactly aft — 1 integers which are uniquely repre- 
sentable. Given A: > 2, there are exactly ab k-representable integers. 
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Proof. First, in the interval [l,o6], there axe, by Corollaries 3.2 and 3.5, 

(g-l)(6-l)_ 

2 

1-representable integers. Using Corollory 3.3, we see that there are 

2 

1-representable integers above ab. For k > 2, the statement follows by similar 
reasoning. □ 



4 Final remarks 



This paper gives only a glimpse of how the ’poly tope- view’ of the Probenius 
problem can be used to gain new results. In dimensions higher than 2, gener- 
alized Dedekind sums ([Ge]) appear in the formulas for N(t), which increases 
the complexity of the problem. The details will be described in a forthcoming 
paper ([Be-Di-Ro]). 

We conclude with a few remarks regarding extensions of the above corollaries 
to higher dimensions. Although no ’nice’ formula similar to Sylvester’s result 
(Corollary 3.1) is known in dimensions greater than 2, there has been a huge 
effort devoted to giving bounds and algorithms for the Probenius number. Sec- 
ondly, we remark that Corollary 3.2 does not extend in general; however, [Ni-Wi] 
gives necessary and sufficient conditions on the a^’s under which Corollary 3.2 
does extend. Corollary 3.3 extends easily to higher dimensions ([Be-Di-Ro]). 
We leave the reader with the following 

Problem Extend Corollaries 3.1, 3.4, 3.5, 3.6, and 3.1 to higher dimensions. 



References 

[Be-Di-Ro] M. Beck, R. Diaz, S. Robins, The Probenius problem, rational polytopes, and 
Fourier- Dedekind sums, in preparation. 

[Ge] I. Gessel, Generating functions and generalized Dedekind sums. Electronic J. Comb. 4 
(no. 2) (1997). 

[Ni-Wi] A. Nijenhuis, H. S. Wilf, Representations of integers by linear forms in nonnegative 
integers, J. Number Theory 4 (1972), 98-106. 

[Ra-Gr] H. Rademacher, E. Grosswald, Dedekind sums, Cams Mathematical Monographs, 
The Mathematical Association of America (1972). 

[Sy] J. J. Sylvester, Mathematical questions with their solutions. Educational Times 41 
(1884), 171-178. 




One Bit World 



David V. Chudnovsky^ and Gregory V. Chudnovsky^ 

^ IMAS, Polytechnic University, Brooklyn, 6 MetroTech Center, NY 11201 
david@imas.poly.edu 

^ IMAS, Polytechnic University, Brooklyn, 6 MetroTech Center, NY 11201 
gregoryOimas . poly . edu 



1 Introduction. 

We want to acknowledge Michael Gerzon of Oxford who had been an early 
pioneer of one bit audio. 

In the beginning there was a word. Analog audio was the first signal seri- 
ously studied and recorded. The first efforts in quantization of analog signals 
were directed towards audio and are known as PCM (Pulse Code Modulation). 

PCM is a technique patented in 1938 by Reeves. It still continues to be a 
major underpinning of all digital speech, audio and video after over 50 years 
of the first hardware implementation in 1947 (H.S. Black, Bell Laboratories). 
Shannon and others analyzed PCM in detail in 1948. It, essentially, consists 
of three steps of processing of analog signals: 
sampling, 
quantizing and 
binary encoding. 

The sampler converts a continuous time analog waveform x{t) into a dis- 
crete sequence of samples Xn = x{n/ fs) with a sample frequency /s, usually 
after the analog waveform is processed by a lowpass filter with the cutoff fre- 
quency /s/2 - Nyquist limit. The samples Xn are quantized and represented 
in a binary form as binary integers of a given dynamic range, which defines 
the number of quantization levels. 

These days the mathematically sophisticated PCM machinery is pushed 
under the rug and given a general name of A/D - analog to digital conversion. 
The dynamic range of A/D converters built by many manufacturers can be 
impressive, though most of the world is so far relatively happy with 16-bit 
(or even less than 16-bit) A/D convertors. This essentially means that analog 
waveforms are only sampled within 16 bit accuracy at uniform time intervals. 

The vastness of digital information encoded with the PCM quantization 
(most often without any compression whatsoever) is staggering. We think that 
the largest commercial repository of digital information is in the form of the 
high fidelity audio files. These days anybody can get them. Just download 
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your favorite ripper program from the Web, pop in your favorite music CD 
and in anywhere between a few minutes and a few hours (depending on how 
much you paid for that CD drive) you get over 500 Megabyte of PCM data - 
a stream of 16-bit integers adequately representing music (or voice) sampled 
at 44.1 KHz. 

On the top of PCM files - which are simple and uncompressed - one has a 
multitude of compression (or compression with encryption) formats. There is 
even a larger set of formats dealing with compression of PCM- information of 
video files. The video files can be thought of as two- dimensional PCM files; 
often represented as a linear PCM file, following the line scan of the video. 

Almost all stored digital non-trivial information is in the form of audio 
or video content. The music information is slightly redundant, but so far no 
lossless compression on music has achieved even 2 : 1 compression rate. 

The PCM represents the first step into the One Bit World 

Technically the PCM digitalization of the information can be considered 
as its conversion into a ’’one bit” stream if one looks at it’s radix 2 positional 
number representation. It is not what we have in mind, when we speak about 
one bit world. Here we literally mean sequences of 0 and 1 (in fact, we prefer 
to call them -f-1 and —1) representing individual bit- words. 

The actual reason for the need of one-bit data stream, accurately repre- 
senting an analog continuous signal, has to do with the hardware. The A to 
D conversion of high accuracy is a grand idea, but, in reality, the nonlinear 
effects and manufacturing difficulties make it very hard to achieve A to D con- 
verters of high dynamic range. On the other hand, a one-bit A/D convertor is 
a linear device (in a mathematical sense), and represents, basically, an easily 
realized comparator. Moreover, once a one bit A to D convertor is built, it is a 
matter of mathematics and not of circuit practice to turn, in digital domain, a 
one-bit stream, albeit of a much higher frequency, into a high dynamic range 
digital sample stream. 

This is the principle behind what is known as Delta-Sigma {A — U): differ- 
entiate the bandwidth-limited analog signal (Delta) into one-bit signal (and 
the remaining noise) stream; and then integrate, or, rather, filter (Sigma) the 
one-bit signal stream into a required digitalization of a high dynamic range. 

1.1 The A - U Solution. 

The first account of the Delta-Sigma belongs to C.C. Cutler in his U.S. patent 
of March 1960, [CU]. H. Inose and Y. Yoshida [lY] introduced the term ’’Delta 
- Sigma” and did the first analysis. In this country J.C. Candy in 1974 [CA74] 
introduced the name Sigma - Delta and popularized its use for A/D and D/A 
converters. 

Starting in 1987 in a series of papers by R.M. Gray and his coworkers 
the rigorous mathematical treatment of the Delta-Sigma appeared, [CA97], 
[GR87], [GR97], [GRCW], [GRND], [NST]. Not surprisingly, this is a number- 
theoretical treatment, relaying heavily on the theory of uniform distribution. 
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We give just the barest example of a relationship with diophantine approx- 
imations, following Gray [GR89]. We do not have to dwell on the modern 
development of uniform distribution as applied to the Delta-Sigma for differ- 
ent classes of functions. We do want to refer to interesting papers of Gunturk, 
some of which were presented at this seminar; cf. [GC]. 

Delta-Sigma modulation is defined by the following (nonlinear) difference 
equations in the time domain: 

_ r iz* if n = 0 

~ -e„_i forn= 1,2,... 

Here Cm (quantization noise) is defined in term of q{u). q{u) is the output 
from the one bit quantizer, defined in the simplest case as: 

, . _ f 6 if u > 0 
QW ^ otherwise 



and 



is the binary quantization noise. 

The intuitive idea of oversampled A/D is to shape the binary quantization 
noise with the filter, so that the quantization noise is mostly out of the base- 
band. A low pass digital filter is used as a decoding filter to produce a digital 
representation of the analog input. 

If the input Xn G [—b,b] and the linear decoding filter H has an impulse 
response {ho, hi, . . . , hn) then the Sigma- Delta A/D converter with decoding 
filter H band sampled at time N will satisfy 

N N N 

H{q{un)) = 'Y^hN-iq{ui) = hN-jXj-i + fiN-ijej - fi-i). 
i=0 2=0 2=0 

Consider a good noise shaping filter. E.g., a sinc^ filter is formed by cascade 
realization of two sine (comb) filters. Unlike the sine filter where each sample 
has equal weight, the weight of a sine^ filter is given by 

r ^ if/c = 2,3,...,iV + l 

^ I — ioT k = N -\-2,...,2N 

which has a triangular shape. 

The first example of the appearance of diophantine approximation (uni- 
form distribution theory) is the seemingly simple case of the DC signal (con- 
stant value): Xn = x. In the context of the simplest quantizer q{x) above, this 
leads to the problem of discrepancy of the sequence of {n-/3} for (3 = With 
the notation of the discrepancy function Dn{{(3}, . . , , {N (3}), determined by 
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the continued fraction expansion of P, we get Gray’s bound on the accuracy 
of DC-level reconstruction: 

The absolute decoding error with sinc^ decoding filter satisfies 

\x-Q{x)\<^Dr,{{/3},...,{Np}). 

Now we can characterize the importance of the Delta-Sigma in the acqui- 
sition of Petabytes of audio data. It is the Delta-Sigma that makes current 
CDs and other HiFi music possible. 

In the current CD mastering process an oversampled one-bit Delta - Sigma 
A/D is used as a front end, but the one-bit output signal must be decimated 
and re-quantized in real time to form the regular sample rate PCM signal. 

Recording a one-bit signal instead of a multi-bit recording format has the 
following advantages: 

1) simple system structure, because of the serial nature without any im- 
plied framing (nothing like the classical endian mode incompatibility); 

2) because the signal is oversampled, the system characteristics approach 
that of high quality analog audio; 

3) the bit stream for a known sampling rate is completely independent of 
the noise shaping filter, thus tapes recorded with different noise shapers can 
be interchanged, or; 

4) recorder performance can be optimized by changing the noise shaping 
filter and not the format; 

5) tape recorders are upward and downward compatible in terms of the 
format and only the quality of replay changes. 

These new formats are the underpinning of new audio files: SACD - super 
audio compact disc and DVD audio. We refer to Gerzon’s last paper for the 
original motivation [CRGE]. 

The Delta-Sigma relaxes demand on the analog circuitry at the expense 
of the increased demand placed on digital circuitry. It is the filtering DSP 
part - commonly known as noise shaping - that becomes paramount. Filters 
of length up to 2K are known to have been built into the Delta-Sigma. The 
quantization part of the Delta-Sigma is a nonlinear iterative mapping, and 
thus has wonderful properties of complex dynamical systems (limit cycles, 
various attractors, ... etc.). Moreover, to get a better noise shape Delta-Sigma 
is often cascaded to provide even more and more sophisticated dynamical 
systems, with even less chances of a complete analysis (cf. [NST]). 

One of the most difficult problems m A — E design is the choice of a high 
frequency - a high multiple of the Nyquist limit - at which to quantize the 
signal. For a sufficiently high multiple no spectral information is lost but at 
the expense of much more storage and bandwidth requirements. 

One of the advantages and simultaneously a problem with Delta-Sigma is 
the local character of its one bit quantization. When one can have a buffered 
stream of analog signals, one can make global decisions on one bit quantization 
in an asynchronous fashion, rather than providing one new bit every clock tick. 
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This global optimal one-bit quantization is very appropriate for the Delta- 
Sigma in D/A case (when the stream of data is known a priory). 

There is yet another hardware term describing this approach - Pulse Width 
Modulation (PWM). PWM is one- bit representation of an analog form as a 
train of 4-1 or —1 square pulses (often, PWM is even mistaken for Delta part 
in Delta-Sigma). 

The Optimal PWM (OPWM) approach provides the optimal number of 
one bit samples needed to reconstruct the original signal stream (in contrast 
with A — U). However, in general the PWM and OPWM approaches do not 
have any particular frequency at which the edges of the pulses are sampled - 
in theory the location of the edges is arbitrary. 

The OPWM problem for arbitrary harmonic series is solved in this pa- 
per using a combination of methods of orthogonal polynomials and nonlinear 
completely integrable systems. 



2 Optimal Pulse Width Modulation (OPWM). 

2.1 General Formulation. 

The most general Optimal PWM formulation is the following. Given a spectral 
bandwidth limited (analog) waveform, find a square form of amplitudes ±1 
that most closely approximates this waveform at the low frequency range. 
More precisely, given a periodic harmonic series 

N 

tp(t) = bo + ^(Ctn sinn^ 4- bn cos nt) 

n=l 

(with the fundamental domain [0, 27 t]), find the periodic pulse train f{t) 
with 2N-\-l (unknown) edges (z = 1, . . . , 2 A" -1-1) in the fundamental domain 
such that the Fourier expansion of f{t) has the leading 2 A 4- 1 coefficients the 
same as 

The main period part of the pulse train can be expressed as follows 



0 = Oq < Q;i < 02 < - . . < 027V < <^27V + l ^ <^27V+2 = ^TT. 

There are, of course, power and other isoperimetric constraints on the 
harmonic form for it to be accurately approximated by a square form 
f{t) (with the leading 2 A 4- 1 harmonic terms). The simplest of them is a 
consequence of the Parceval identity (and expresses a fact that the amplitudes 
of individual waves have to be limited in order to be represented by one bit 
pulses): 



N 

^^0 + + 6 ^) < 2 . 

n=l 
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In general, we ask for 



max \'ip(t)\ < 1. 

tG[ 0 , 27 r] - 

If a good pulse approximation f{t) to can be found, this f{t) can be 
used as a ’’digital” (or, rather, a one- bit) representation of It can be 
quantized in the t - domain by rounding the corresponding values of ai to a 
fixed frequency. It can be further compressed using additional properties of 
the distribution of Jumping a little bit ahead such asymptotic properties 
of ai do exist, because are related to the zeroes of orthogonal polynomials. 
This representation is also excellent for storage and transmission, especially 
for audio signals. 

Of course, in the real world, i.e., in the analog world, the square waveform 
does not look at all as a harmonic one. Thus to make a square form look 
(and, most importantly, sound) good, it has to pass through a low pass filter to 
restore its spectral bandwidth limitation. In audio playback this is the famous 
problem of psycho-acoustic spectral matching (according to an experimentally 
observed graph of spectral weights). For instrumentation providing the best 
quality of reproduction, the one bit nature of the encoded waveform is crucial, 
because one bit A/D and D/A converters provide the best signal quality. 

2.2 The Classical Pulse Width Modulation (PWM) Problem. 

The classical PWM - Pulse Width Modulation had been developed in the 
60s and 70s as a one-bit representation of waveforms with a very interesting 
feature: only square forms of amplitudes +1 or —1 are allowed, with positive 
and negative edges at arbitrary sample times. This is in a sharp contrast with 
PCM/DPCM and Delta-Sigma (Sigma-Delta), where the sampling happens 
at uniformly spaced times. Such an asynchronicity /non-uniform sampling is 
an important tool in any attempt to fight the Nyquist limit on the number of 
sample points vs. spectral support of the signal. 

Initial interest in PWM had been almost entirely driven by special require- 
ments of power electronics. There the only important signal to be represented 
is the primary (lowest) sine harmonics. Typical applications of PWM are 
in power electronics: digitally controlled electric motors, high voltage power 
transmission and control of quality of electrical power grids. 

In simple mathematical terms the basic PWM problem is that of approx- 
imating a sin wave of a given amplitude A: 

Asin^ on [0, 27 t] 

by a train of square pulses of +1 or —I amplitude (independent of A) such 
that spectral characteristic of the periodic pulse train (square waveform) is 
the same as that of a sin wave A sin t in the low frequency range. 

Because of the quarter period symmetry, it is enough to consider a square 
form only on [0,7t/2]. If one has N square forms to define on [0,7t/2] that 




31 



’’spectrally approximating” sine wave, one has N free parameters (edges of the 
pulses) o Patel in 1970 had been the first to derive an explicit set of equations 
on N pulse edges to eliminate N harmonics in the Fourier expansion of the 
full periodic pulse train (with 4N pulses per full period), see [PH]. It should 
be noted that in view of the quarter period symmetry, only sin(nt) terms are 
left with n odd. The resulting equations have the following form. 

For the quarter-period symmetric (periodic) pulse train f{t) with edges 
0 = ao < ui < . . . < < a^v+i = tt/2 (and /(qq) = -1, /(<^a^+i) = +1). 

we have the following harmonic expansion 

CX3 

f{t) = Vn- sin(ni), 

n=l,n— odd 



where 



A 

K = — • (-1 + 2 • V (-!)'+' cos(naO). 

nn ^ 

2=1 

Thus for the approximation of the basic waveform 

ai sin t 

to within + 1 first harmonics by the pulse train, the following set of N 
transcendental equations on unknown ai has to be satisfied 

N 

cos(nai) = Cn : n = 1, 3, . . . , 2N — 1 

2=1 



where 



^ / (f“i + 1/2) if n = 1 

” \ 1/2 if n > 1 

In general, for arbitrary quater-period symmetric harmonic polynomial we 
get the ’’system of Classical PWM equations” (in the transcendental form) by 
writing [3i = ai for odd i and pi = n — ai for even i: 

N 

cos{npi) = Cn : n = 1,3, ... ,2N - 1. 

2=1 

The case of an approximation of a single basic waveform (main PWM case) 
leads to the following system of transcendental equations: 

n n 

y^ cos ai = a, cos (2m - l)a^ = 0, m = 2, . . . ,n 
2=1 2=1 
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3 Orthogonal Polynomials, Solitons and Classical PWM 
Problem 

The classical PWM problem can be reduced (see below) to an algebraic 
problem of ’’odd sums of powers” where one has to determine the set {xi} 
{i = 1 .. .n) from the equations on ’’odd sums of powers”: 

n 

^ ^ ~ ^2m—l •) ^ ~ 1 ... 72 . 

i=l 

This and more general OPWM problem are analytically solved using Fade 
approximations and completely integrable isospectral deformation equations. 
We present here an analytical solution, and we describe fast methods of the 
numerical solution needed for practical applications. 

This work on the Classical PWM has a variety of possible practical ap- 
plications. It is a part of an effort at Polytechnic University conducted by 
Dariusz Czarkowski, Ivan Selesnik and authors of this paper. The main em- 
phasis of that effort is on practical implementation of the PWM solution that 
can be useful in power electronics. See a detailed review in [CCCS]. 

A very interesting feature of the general solution to the PWM problem lies 
in its connection to the classical areas of mathematics - symmetric functions, 
orthogonal polynomials, and the theory of completely integrable systems. The 
most surprising relationship is with the Korteweg-de Vries (KdV) hierarchy 
of infinite dimensional Hamiltonians. We show how the complete solution of 
the PWM problem describes, in fact, the class of all rational solutions of KdV 
equations. 



4 Subsequences of Symmetric Functions and zeros of 
Orthogonal Polynomials 

The problem of finding the set of n elements {xi} with given values of n 
arbitrary symmetric functions {sj} in 2 — 1 . . . n, is in general a very com- 
plicated one because of the nontrivial nature of relations between symmetric 
functions of high degrees. Only in special cases can this problem be reduced to 
an ’’exactly solvable” one. For this one looks at the polynomial -P(x), whose 
roots {xi] are: 

n 

p{^) = - Xi) 

i=\ 

This polynomial P{x) is uniquely identifies the set {xi). The coefficients of 
P{x) are elementary symmetric functions in x^, which have to be determined 
in order to determine the set {xi}. Newton’s formulas for sum of powers of Xi 
show that once one knows all first n symmetric functions 
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n 

2=1 

then one easily finds - by means of Newtons ’s linear recurrences - the elemen- 
tary symmetric functions in Xi - the coefficients of P{x) - consequently finding 
the set {xi} as the set of roots of P{x). 

The derivation of Newton’s recurrences is simple; it requires a look at the 
logarithmic derivative of P{x): 

Expanding the logarithmic derivative at x = oo, we get: 

oo 

E ^m 

1 

m=0 

for Sm = Sr=i Comparing the standard expansion of P{x) at x = oo 

P(x) = x"" + aix"^~^ + . . . + ttn-lX + an 

one gets Newton’s linear recursion relations between and Sj. 

Newton’s identities explicitly are: 

k 

^ ^ ^2 * Q'k — i 

2=1 

Other relations between ai and Sj can be found using the following identity 
(that we will use in later generalizations): 

P(X) = X^ e~ Sm=l rnx/>^ _ 

Very important combinatorial interpretations of the last identity were used 
extensively by MacMahon and others ([M], [Li]): 



PM. 

P(x) 






lAi .2A2 .3^3 ...Ai!.A2!*A3!' 



for all partitions A where 1 • Ai + 2 • A 2 + 3 • A 3 . . . = k. 

What happens, if instead of classical consecutive sets of symmetric func- 
tions one knows only the values of n non-consecutive symmetric functions? 
The problem poised in the PWM method of power electronics looks first at 
n odd power sum symmetric functions S 2 m-i for m = 1 . . . n. The solution of 
that problem also leads to a sequence of linear recurrences, but this time not 
among the invariants themselves but the sequences of polynomials associated 
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with them (these are Pn{x) = P{x) as n varies). This solution is based on 
Fade approximations and orthogonal polynomials. 

We start with the general relation between Fade approximations and the 
generalization of Newton relations between power and elementary symmetric 
functions. Let us look at the Fade approximation of the order (n, d) to the 
series g{x) at x = oo. Here the function g{x) is defined via the generating 
function of an arbitrary sequence S'm- 

E ®® Sm 

mxm ^ 

The definition of the Fade approximation of the order (n, d) to g{x) at (the 
neighborhood of) x = oc is the following - it is a rational function with 

Pn{x) a polynomial of degree n and Qd{x) a polynomial of degree d - such 
that the expansion of matches the expansion of g{x) at x = oc up to 

the maximal order. This means that 

or 

Pn{x) - Qd{x)x^~‘^g(x) = 0{x~‘^~^). 

After taking the logarithmic derivative of this expression, we end up with 
the following representation of the definition of Fade approximation: 

^-TT = ^ ^ogx^-^gix) + 

Pfi ^X 

Now if we write normalized (with the leading coefficient 1) polynomials 
Pn{x) and Qd{x) in terms of their roots: 

n d 

Pn{x) = - Xj); Qd{x) = ff (a? - Vk), 

2=1 k=\ 

we get an identification of symmetric functions in xi and yk with the sequence 

Sm 

m=3 ^ Namely, we get: 

±4-tyi = Sj 

2=1 k=l 



for j = 0 . . . n + d. 

Of course, in the case d = 0 one recovers Newton’s identities. The case 
d = n - the so-called case of the ’’diagonal” Fade approximations - is the 
most interesting case. It is also the case that solves the problem of ’’sums of 
odd powers”. This is how it works. Consider the ’’anti-symmetric” case when 
Vi = —Xi for 2 = 1 . . . n and d = n. In this case S 2 m = 0 for m > 0 and 
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*52771-1 = 252m-i* We thus get Fade approximations of the order (n, n) to the 
following function: 

G{X) = e~^ X^modd rn^ ^ 

The Fade approximants have the property: 

Qn{x) = {-lYPn{-x), 

because G{x) satisfies a functional identity: G{—x) = 1/G{x). 

This gives us our main result: 

Theorem 1. The solution {xi] (z = 1 . . . n) to the problem of ’’odd sums 
of powers”: 

n 

^ S 2 m- 1 , m = l...n 

2=1 

is given by the roots of the numerator Pn{x) = nr=i(^ ~ ^i) ^^le Fade 
approximation problem of order (n, n) to the function 

G{x) = ^ . 



Another way to verify this approximation without specialization from the 
case of general sequence Sm , is simply to take the expansion of the logarithmic 
derivative of oX x = oo. Expanding the logarithmic derivative, and then 

integrating it (formally) in x^ one gets a very simple identity 



(-ir 



Pn{x) 

Pn{-X) 



-2 






From this identity Theorem 1 follows. This is a new formula that Newton 
could have found. 

Proof of Theorem 1. First of all, the Fade approximation rational function 
^4^ of order (n, n) is unique. Then, if 54^ is a Fade approximation of order 
(n, n) to G(x), we assume that this representation of the rational function 
is irreducible (i.e., that Pn{x) and Qn{x) are relatively prime). Then 
is a Fade approximation of order (n,n) to 1/G(x), and is a Fade 

approximation of order (n,n) to G{—x). Because of the functional equation 
C[—x) = 1/G(x), and the uniqueness of the Fade approximations, we get 
. This equation means that Qn{x) = aPn{—x). Moreover, 



Pnjx) 



1 as 



Prr(x) 

since the expansion of G{x) a,t x = oo starts at 1, we have 
X oo. Thus Qn{x) = (—l)'^Pn{—x). Taking into account the ’’main” 
identity 

Pn{x) -2^“ ^ V" xr 

= e Z-^m odd Z_-/i = l * 



(-I)"- 



Pn{-x) 



we can see that the right hand side of this identity and the expansion of G{x) 
at a; = oo has to agree up to (but not including) This means that 
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we have ~ ^ 2 m-i for rn = l...n. Note that in the definition 

of G{x), the values of 52m - 1 for m > n have no impact on the definition of 
Pn{x) because they enter the expansion of G{x) only at x~^ for /c > 2n + 1. 

This is so because 

Since we identified the solution to the ’’sums of odd powers” problem with 
numerator (or denominator) in the (diagonal) Fade approximation problem, 
we infer from the standard theory of continued fraction expansion that ratio- 
nal functions are partial fractions in the continued fraction expansion 

of the generating function G{x) at x = oo. This also means (see [S] for these 
and other facts of the theory of continued fraction expansions and orthogonal 
polynomials) that the sequence of polynomials Pn{x) is the sequence of or- 
thogonal polynomials (with respect to the weight that is Hilbert transform of 
G{z)), and that the sequence of polynomials Pn{x) satisfies three-term linear 
recurrence relation. Since the same recurrence is satisfied both by numera- 
tors and denominators of the partial fractions, it means that the recurrence 
is satisfied by two sequences - Pn{x) and (—1)^ • Pn{—x). With the leading 
coefficient of Pn{x) is 1, one gets a particularly simple three-term recurrence 
relation among Pn{x): 



Pn+l(a:) = X • Pn{x) + Cn ' Pn-l{x) 



for n = 0 . . .. 

5 Algebraic Problem of Odd Sums of Powers vs the 
Transcendental Problem of Sums of Odd Cosines 

The original definition of PWM problem dealt with transcendental equations 
cos (2m — l)ai = C 2 m-i, and not algebraic equations = 

52m- 1 - A very important contribution to the PWM problem by D. Czarkowski 
and I. Selesnick is in the explicit reduction of the transcendental PWM prob- 
lem to the algebraic one for consecutive m = 1, . . . , n. This is what allows us 
here to use the solution of the algebraic problem of ” sums of odd powers” for 
the solution of the original transcendental PWM problem. We will show now 
how the transcendental case is explicitly expressed using the introduced no- 
tations of G{x) and Pn{x). The basic transformation is cosa^ = Xi or in the 
algebraic form: x = {z z~^)/ 2, for z = e^. Now, to get from the algebraic 

’’sums of odd powers” solution to the transcendental one, consider the set of 
roots Zi, Zi~^] z = 1, . . . , n. Then ’’sums of odd powers” for these roots gives 

n 

cos (2m — l)ai for cosa^ = Zi. 

2=1 

This allows us to represent the polynomials Pn{x) in 2 -form. Namely, if we 
define 
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Pa{z) == fl(z- Zi), Pb{z) = f[{z- —), 

i=l i=l 

then we can express Pa(z) • Pb(^) as a polynomial in x = |(^ + 1/^) • Pa(z) • 
Ph{z) = z'^ • Pn(x). This allows us to express the Fade approximation to G{x) 
(and G{x)) as a similar function of We have: 

Pnjx) ^ Pg{z)Pb{z) 

^ ^ Pn{-X) P^(-z)Pi,{-zY 

Expanding each term on the right as a function of z~^, we get 

/jxn . Pnjx) ^ 2 E“=i 

^ Pni-X) 

where Tm = = 2 cosma^. 

This means that the function G{x) (or its Fade approximation) that de- 
termines the solution of the algebraic ’’sums of odd powers” problem (in x) 
can be reduced to the transcendental ’’sums of odd cosines” problem (in z). 
Specifically, G(x), as a function of z has the following form: 

G(X) = e~^ Xlmodd rrxzrr^ ^ 

where the sequence {cm} arises from the following general transcendental 
’’sums of odd cosines” problem 

n 

cos (2m — l)ai = C 2 m-i, 'm = 1, . . . ,n 

i=l 

corresponding to the algebraic ’’sums of odd powers” problem: 

n 

^ = S2m-1, m = 

i=l 

with Xi = cos ai for z = 1, . . . , n. 

Notice that the ’’explicit expression” for Xi {ai) or Pn{x) simply means 
that the continued fraction expansion of G{x) (in x or ^ at infinity) is known 
’’explicitly” , or equivalently that the coefficients Cn in the main three-term re- 
currence describing Pn[x) are ’’explicit” (i.e., classical elementary or transcen- 
dental) functions of n. From this point of view, the main FWM transcendental 
problem 



n n 

cos ai = a, cos (2m — 1)^^ = 0, m = 2, . . . , n 

i=l i=l 

is not ’’explicitly solvable” because the corresponding function G{x) (see the 
expression above in terms of z and the corresponding sequence {cm})- 
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Ga{x) = 

does not have an ’’explicit” continued fraction expansion at x = oo. On the 
other hand, a very similar algebraic problem: 

n n 

2=1 2=1 

does have an ’’explicit” solution, since the corresponding function 
has a classical continued fraction expansion derived by Euler. The polynomials 
Pn{oc) arising in this special algebraic problem are well-known under the name 
of Bessel polynomials. 

An application of the solution of the transcendental ” sums of odd cosines” 
problem in PWM applications lies in is the ability to construct high quality 
digital (step-function) approximations to arbitrary harmonic series. Since the 
proposed algorithm can be executed quite fast on any processor with high- 
performance DSP capabilities, it opens the possibility of better on-the-fly 
construction of arbitrary (analog) waveforms using simple digital logic. 



6 Completely Integrable Difference- Differential 
Equations of Korteweg-de Vries type. 

In connection with Pade approximation solution to the ’’sum of odd pow- 
ers” problem, we can ask what ” nonclassical” objects this solution is built 
from. These objects are ’’soliton equations” and their solutions - isospectral 
deformation equations of the full Korteweg-de Vries (KdV) hierarchy. 

The only parameter left in our solution is the coefficient factor Cn of the 
three-term recurrence for polynomials Pn- 



Pn+l{x) = X ■ Pn{x) + Cn ■ Pn-l(x). (1) 

This parameter Cn is a function of n and the whole generating sequence 
{sm • ^ — odd} of odd symmetric functions. It is Cn that is a solution of the 
KdV type hierarchy of completely integrable p.d.e.s in variables Sm- In addi- 
tion to p.d.e.s in Sm the parameter Cn satisfy difference-differential equations 
in n and each of 

The formal derivation of the full hierarchy of such equations is based on 
our paper [Ch3]. To see simply how one can derive them we can start with 
(1). Think of x as a spectral parameter: x = A. Then (1) is an eigenvalue 
problem for the second order linear difference operator in n. The crucial next 
step is the realization that polynomials Pn also satisfying linear differential 
equations in each of the variables Sm- Once such a differential equation in Sm 
is derived, one looks at the consistency condition of this equation and (1). 
Such a consistency condition is a classical isospectral deformation condition. 
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This consistency condition implies a nonlinear difference-differential equation 
on Cn in n and Sm (for any odd m). The resulting nonlinear equation belongs 
to a completely integrable class. Eliminating n for Sm and Smh one gets a 
KdV type p.d.e. on Cn in Sm and Smt- 

To see how these are derived let us look at the case of m = 1 and the 
variable sq. The corresponding partial differential equation on is: 



^ * Pn,s\ — Pn Pn * Pn — 1 (2) 

where the parameter En does not depend on x. The consistency condition 
between (1) and (2) leads to the following two difference-differential equations 



— En ’ Pn+l? ^n,si — Pn+1 En 



( 3 ) 



We also summarize here all relationships with Toeplitz determinants in 
the expansion 



m=0 

of the approximated generating function of ’’odd sums of powers” {s 2 m-i}* 

G = e”^Smodd 



Specifically we use the standard representation of the main and the aux- 
iliary Toeplitz determinants: 

Dn = det(c^-^J An — det(ci-^_^_f-i)^j^Q. 

The relationships between Cn and the determinants are the following ones: 

The expression of En in the difference-differential equations above is: 

7^2 Pn+1 ' Pn— 1 

^ m — • 

n 

The main object Dn is expressed in terms of the r- function (typical nota- 
tion for KdV equations): 

Dn = 

where is a polynomial in 52m- 1 - We need, however, to normalize the poly- 
nomial Tn,'so that the leading power of 5i (and it is would have a 

coefficient of 1. In this case we can write more accurately: 

Pn ~ ^n * '^n ’ 
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The rational numbers Vn are easily determined using the following expres- 
sions: 

jp 1 '7'n+l ‘ '^n—1 ^ 1 '^n+2 ’ '^n—\ 

Vl ’ “ 4^^^ ‘ r.-r.+i * 

This implies rn+i = -r^/((2n— l)^*rn-i). Substituting these expressions 
in equations (3) one can get the difference-differential equation on in n and 

Si: 

'^n+l,x * '^n — 1 '^n+1 * '^n—l,x ~ (2tI 1) • T^. 



7 Korteweg-de Vries Hierarchy and all that 

Explicit recursion relations defining all commuting (higher) Korteweg-de Vries 
(KdV) flows are well-known. These relations connect the infinite sequence of 
conserved quantities Hm of the original KdV equation - these are the (infinite 
dimensional) Hamiltonians of the higher KdV equations - with the vector 
flows Xm of (m -th) KdV equation. The m-th KdV is: 

Utm = ^m(u); for Xm(u) = 8^ 

SHm/Su is the gradient of the functional Hm of u, e.g., for the first KdV 
Hamiltonian (the actual KdV equation), we gave: 

5 H 2 3 9 1 „ _ __ /*/l 1/ /\‘2^ 

- = -u --U, forH2 = Ji-u +-{u) )dx. 

The flows Xm{u) are commuting (as Poisson structures induced by Hm)- 
Recursion connecting successive Hamiltonians are relatively simple: 

XM = dJJ^ = + + 

One can also write all higher KdV flows Xm in the explicit form using this 
recursion as follows: 



X^{u) = 

using the operator: 

Nu = ~^dl + 2u + uM^- 

The first few higher KdV equations thus are: 

Uti = u'\ 



Ut2 



Zuu' - -u"'; 
2 ’ 
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One should not confuse x variable in KdV equations with x used in the 
’’sums of odd powers” PWM problem. The variable x in the PWM problem 
is the spectral variable, usually denoted as A in 

G = e“^S-odd 

and the polynomials, whose roots solve the PWM problem will be denoted 
as Pn(A). 

Further, the identification of odd moments 52m- 1 with the canonical vari- 
ables of p.d.e.s in the KdV hierarchy, is the following one: 

X = 5i; 

and for the higher fiows ’’time variables” tm we have 

, ^2m— 1 

“ (2m- 1)2'"-! 

(so the standard KdV time is t = 53/6). 



8 Solution to the PWM Problem via KdV rational 
Solutions 

The relationship between the KdV hierarchy of equations and the general 
’’sums of odd powers” problem is very interesting one. Roughly speaking the 
parameter Cn in the recurrence relation defining the orthogonal polynomials 
in the ’’sums of odd powers” problem, £is functions of odd moments 52m- 1 
satisfy all p.d.e.s in the KdV hierarchy, with x = si and higher flows ’’time 
variables” tm being tm = (2m-T)V">-i • 

Moreover - and this is what distinguishes the ’’sums of odd powers” case, 
and completely characterizes it in terms of KdV equations - rational solu- 
tions to the KdV equation and to the full KdV hierarchy are com- 
pletely described by the solution to ’’sums of odd powers” problem. 

The class of all rational solutions to KdV, well-studied since 1977 , and 
still under great deal of investigation today, has many interesting and impor- 
tant properties - this class is a limit case of the famous W-soliton solutions 
corresponding to special rational curves; it has a famous many-particle in- 
terpretation in terms of dynamics of poles of these solutions in x- (and tm-) 
planes, etc. 

The specific KdV relationship is the following. We look, as above, at the 
orthogonal polynomials Pn(A) representing the Fade approximants to the gen- 
erating function G of the sequence of odd moments 52m- 1* 

^ ^rn 

G = e ^modd rnX^ ^ 
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and satisfying the three-term recurrence 

P„+l(A) = A • P„(A) + Cn ■ Pn-l(A). 

’’Explicit” expressions for Pn{^) and C„ involve, as above, Toeplitz deter- 
minants in the coefficients Cm of the expansion of G at A = oo: 



Ga(A) 



oo 

E ^m 
m=0 



Specifically we use the standard representation of the main Toeplitz de- 
terminant (as above): 

Dfi = det(ci-|-j)^j_Q. 

There is also the auxiliary Toeplitz determinant An: 



An — det(Q4-j;_|_i)^ -_Q 



The relationships between Cn and the determinants are the following ones: 



Cn = 



An+l * An—\ 



Al = (-ir2P„-D„+i. 

The main parameter Dn is expressed in terms of the r-function of KdV- 
type equations: 

D ~ 

— ‘n ’ 

where is a polynomial in .S 2 m-i- 

The KdV solutions are expressed in terms of the potential u, very simply 
related to r as follows: 

U„ = -2dl{\ogTn) = -9|(logP>„). 

(Here as above x = si). 

The relationship between KdV hierarchy and the ’’sums of odd powers” 
recurrences is the following one: 

Theorem 2. If the generating sequence of odd moments {52m-i} is con- 
sidered as a sequence of independent variables, then with the identification 
X = si, and tm = ^ functions Tn = are all rational 

solutions of KdV equation (and all commuting higher KdV equations). The 
polynomials are characterized by the their degree in x - which is . 

An important consequence of this theorem is the characterization for the 
first time of the full manifold of rational solutions as explicit functions of 
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actual higher KdV natural parameters tm (see above the identification tm = 
with odd moments variables) . This completes the study of rational 
solutions of KdV that we started more than 20 years ago [Ch4] . 

We write down explicitly the first few r^, normalizing them (for unique- 
ness) with the coefficient at being 1 (remember that x = Si): 

T2 = 5i; 

T3 = sf - S3; 

T 4 = s\ — 5siS 3 + 5s| + 9siss; 

Ts = - 15 siS3 — I 75 S 1 S 3 + 63(5siS 3 + sf - 3 ss)s 5 + 225(ss — sf); 

The recurrence that these normalized polynomials satisfy is a known 
one (it also follows from the difference-differential equation on Cn in n and 
si), that contains a crucial ambiguity, hiding in constants of integration the 
explicit dependency on si: 



'T~n+l,x ' '^n—1 '7”n+l ’ — (2?7- 1) • T^. 

Theorem 2 and the direct relation to the Fade approximation problem to G 
provides a much simpler theory of rational solutions to the KdV hierarchy than 
all other descriptions (we refer to [AMM] and [Ch4] for an original exposition, 
and [Z] for the modern presentation and review). 

An interesting corollary of Theorem 2 is that the polynomials Dn (rn) and 
rational functions Cn are derived by means of successive Backlund transfor- 
mations from the initial n = 0 (i/ == 0) case. 



9 New Difference - Differential Equations In 
Nonintegrable Cases - PWM Examples. 

Whenever the function G that is expanded into its continued fraction, does 
not satisfy a Riccatti equation over C(x), there is no simple Painleve equa- 
tion/recurrence on partial quotients Cn- Most G fall into this category and 
examples of G from PWM problems are not ’’integrable” either. The most 
interesting example of G depends on the parameter a (voltage level): 

0 ^—4a{x — \/x'^ — l) 

expanded at x = oo. The corresponding partial quotient Cn = Cn{ci) has to 
be determined as a function of a in order to compute solutions Pn(x) of the 
main PWM problem. We show that Cn(a), though not a Painleve function, 
satisfies an algebraic difference -differential equation in n and a. 

We start with the definition of Fade approximation to G: 

qn-G -pn = 
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where qn = qn{^)^Pn = Pn{^) are polynomials of degree n in x, and Pn(^) = 
{—l)'^qn{—x). From the expansion of G at x = oo one gets leading coefficients 
of qn- qn = ax'^~^ + q 2 ,nX'^~‘^ + . . . , where g^ 2 ,n+i = 92 , n + Ci, or q 2 ^n = 

Gi. G satisfies a linear p.d.e. over Q(x, a): 

L - Gx = 4aG for L = {x^ — 1) • dx ~ a • x • da- 

We can differentiate the definition of the Fade approximation, and get: 

Qi^^-G-p^ = o{x-n; 

= {x^ - l)qn,x - aXQn,a + 4ag„; = (x^ - l)pn,x - axpn,a- 

Because of orthogonality properties of qn{x), we can express Qn^ as a 
linear combination of only a few of qm^s: 

Ql^'^ = Q„ • qn -1 + 2a ■ qn + n ■ X ■ qn. 

Here we define an expression built from previous Cm- 

n—l n—1 

an = —2 • ^ Gi — a ' Ci^a + 2a? — n. 

i=l i=l 

As a result we have a linear partial difference-differential equation on qn 
in n, a and x. This equation is compatible with the original three-term linear 
recurrence on The consistency condition becomes our new equation on 
Gn(a). To represent this consistency condition in a transparent form we use 2 x 
2 matrices and a vector of two consecutive polynomials and the corresponding 
shift operator -h: 




Then we have the original recurrence written as 
q+ = Mn-q\ Mn= 



We can write the expression of Qn ^ as follows: 



L{q) = Kn'q\ Kn 



nx — 2a an 

an—ljCn—l l)x 2u an—ljCn—1 * X 



The compatibility condition now becomes 



L{Mn) -^Mn^Kn = {Kn)^ * 



Using this definition of an in the first equation we can write a single 
differential equation in a on Cn that involves only Cn-i and Cm for m < n — 1: 
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Cn,a — — ^n-1 ‘ Cji H 



for 



d ^n— 1 



Its solution can be written in quadratures as follows: 



The initial condition on Cn as a function of a is: Cn \a=o= —1/4. 

A more detailed algebraic analysis of Cn as a rational function of a re- 
veals more algebraic structure not dissimilar to that of solutions of Painleve 
equations. The canonical determinant is 

An = det(ci+j+i)”jio 

in G = E“=oCn/a;”. In these notations we have 

^n+l4\n-l 



Cn = — 






Canonical polynomials ujn in a: 

degcjn = (n 4- 1^/2 

for odd n, 



degcjn = 



n{n -h 2) 



for even n. 

In terms of polynomials ojk we get the following expressions 

UJk-3 • 



Ck = rik 



LUk-2 • ^k-l 



COnst^2 * d)n — 1 * d)n — 2 ' d . 

rik 2k 

n/c+i 2A: - 1 



We can write then 






-,2n+4 



(jJy^ 



^n-2 

We can get the following expression for Un (and Cn) in terms of ujm for 
m < n: 



/ 



a 



• a2"+2 . + Const = 

^n—3 ^n—2 



These differential equations or integral representations allow for a simple 
recursive computation of C'n(a) as a rational function from Q(a). 
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10 Complexity of Computations of PWM polynomials 
and solutions 

What is the complexity of computations of PWM polynomials and their roots 
- i.e., of the solution to the classical PWM problem? One can ask the same 
question about all n first sums of powers. If one uses just Newton identities, 
the complexity is O(n^), but a much faster scheme can be found. The key to 
this is: ^ ^ 

P{x) = e~ ^m=l rn^ ^ 

Indeed, according to Brent’s theorem N terms of the power series expansion 
of can be computed in only 0{N log N) steps from the power series 

expansion of V{x) (see [Kl], Sec. 4.7, Ex. 4.) This algorithm requires only use 
of the FFT-technique for computation of fast convolution. 

Similar complexity considerations can be applied to the problem of fast 
computation of polynomials Pn{x) that give the solution to the classical PWM 
problem of consecutive odd power sums. In fact, the arguments presented in 
this and the following sections are applied without significant changes to the 
solution of the general OPWM problem that uses two-point Pade approxi- 
mations. A classical Levinson algorithm of solving systems of Toeplitz linear 
equations, or algorithms based on three-term recurrence relations satisfied by 
Pm{x) give the complexity of computations of (all coefficients of) Pn{x) as 
0{rP). These algorithms are, perhaps, the best in the range of moderate n 
because of their simple nature and the fact that they use almost no additional 
space. Also the 0{rP) complexity algorithm provides with the determination 
of not only the single Pn{^) but all Pm{x) for m < n. For large n these 
algorithms became impractical. Thus one needs to use fast algorithms. 

This is how a fast algorithm of computations of (all coefficients of) Pn{x) 
of the total complexity of 0{n log^ n) works. First, one has to apply Brent’s 
theorem to compute 0{n) terms of the power series expansion (at infinity 
X = oo) of 

— 9 Sm 

G{x) = e 

from the first 0{n) known coefficients Sm with the complexity of only 
0{n log n). Then one has to use fast Pade approximation algorithms (or, equiv- 
alent fast polynomial gcd algorithms). There are a variety of these algorithms, 
with the most popular from [BGY]. Its complexity is 0{n log^n). Thus we 
can compute Pn{x) in at most 0(n log^ n) operations. Of course, this method 
should be used only for a large n (with additional precision of calculations) 
since it relies on a variety of extensions of FFT methods. 
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11 A Simple Algorithm for Orthogonal Polynomials 
Computations 

A simple algorithm of computation of all Pm{x) for all m = 0, . . . , n , having 
the complexity of O(n^), that we mentioned above, is easy to describe. Let us 
look at the expansion of G{x) at x = oo, written in the following form: 

CX) 

G(x) = 

k=0 

By Theorem 1 polynomials Pn{x) are defined from the Fade approximation 
problem to G(x), i.e., the remainder function 

Rn{x) = Pn{-X)^G{X) - 

has the following expansion at x = oo: Rn{x) = Also, since Pn(x) 

are orthogonal polynomials, they satisfy the three-term recurrence 



Pn-^l(x) = X • Prrix) Cri ‘ Pn-l{x) 

for n = 0 — The initial conditions can be chosen here as: P_i = 1, Pq = 1. 
Let us write Pn{x) in terms of its coefficients: Pn{x) = Xir=o * x'^. 

If Pn-i{x) and Pn{x) are known, then in order to determine the single 
unknown and thus Pn-fi(x) (via the recurrence), we have to look at the 
coefficient at x~^ of Rn{x). Assume as before, that the leading coefficient of 
Pn{x) is 1, i.e., Pn,n = L Looking at the coefficient at x~'^ in the expansion 
of Rn{x) we get the following expression for Cn- 



Cn 



E n 

j=0 

E n— 1 

i=0 



Pn,j ' 

Rn—l,i * 



Once Cn is determined, the coefficients Pn-\-i,i ^re easily determined recur- 
sively. 

In order for this algorithm to work, one needs coefficients Ci in the expan- 
sion of G(x). As we mentioned above, the complexity of Brent’s algorithm of 
computations of a up to 0{n) that uses FFT is O(nlogn). 

In fact often the complexity is bounded only by 0{n). For example, in the 
most interesting case of the PWM problem: 



G{x) = Ga{x) = 

the algorithm for computing Ci is a very simple one of complexity 0{n) only. 
This algorithm follows the authors’ general power series algorithms [Ch5]. 
The key to this algorithm is to notice that Ga{x) satisfies second order lin- 
ear differential equation (with singularities at x = — 1, l,oo and an apparent 
singularity at x = 0): 
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y"x{x^ — 1) + y'{8ax{x^ — 1) + 1) + y(4a(l — 4ax)) = 0. 

If we look at the expansion of Ga(x) at x = oo: Ga{x) = 
we get the 4- term recnrrence on c„: 

^«+2 = 8a(n+2) ‘ (cn+i((n + 1)^ + (n + 1) - 16a^) + c„(8nn + 4a) + 
c„_i(-(n - 1)2 - 2(n - 1))). 



The initial conditions are c„ = 0 for n < 0 and 

Co = 1, Cl = 2a, C 2 = 2a^, 

From these values and the recurrence for c„ one derives the C„ factors in the 
three-term recurrence for orthogonal polynomials Pn{x). Here are a few initial 

^ ^ 4a2 - 3 ^ 45 - 60a2 -h 16a^ 

Co - a, Cl - > <-2 - _ 3 ) ■ 

Here all C„ are rational functions in a of rather special structure. Since the 
case of the continued fraction expansion of Ga (x) is not ” explicitly solvable” , 
Cn is not a ’’known” function of a and n. An important consequence of the 
’’unsolvability” of C„ is the growth of coefficients of Gn as rational functions 
in a with integer coefficients. According to standard conjectures about explicit 
and non-explicit continued fraction expansions (see [Chi]), the coefficients of 
Cn as the rational function in a over Z are growing as ^ for large n. In 
fact, for n > 12, the coefficients of in a are large integers. This makes 
it impractical to precompute with full accuracy for large n. It is also 
unnecessary to analytically determine = Gn(a) explicitly, since we need to 
know Cn{a) only in the range of a that is significant for applications - this is 
the range where the weight of orthogonal polynomials Pn{x) is real. 

Let us show explicitly Pq{x) = Pg{x,a) the polynomial: 

(-1099511627776 a^i -h 49478023249920 a^°x -I- 

3809807790243840 a^^a; (-3 -b 4x^) - 272129127874560 a39(-l -p 4x^) - 
714338960670720 a^^ (43 - 328 -b 208 x^) -b 714338960670720 a^^x (1685 - 
4280 a;2 + 1456 a;^) -b 139296097330790400 a^^x (-551 -b 2005x2 - 1316 x"^ -b 
128 X®) - 11608008110899200 a®® (-181 -b 1980x2 - 2400 x^ -b 448 x®) - 
4673280144593651143490437500000 a2 x (3 - 45 x2 -b 180 x"* - 264 x® -b 128 x®) + 
438120013555654794702228515625x(5- 80x2 -b336x^ - 512 x® + 256 x®) - 
438120013555654794702228515625 a (1 - 40x2 + 240 x^ - 448 x® -b 256 x®) -b 
2336640072296825571745218750000 a® (1 - 40 x2 -b 240 x“* - 448 x® + 256 x®) + 
34824024332697600 a®2 x (95319-442680 x2-b420084 x^-82368 x®-bl088 x®)- 
2742391916199936000 a®® X (37688 - 209833x2 -b 255880 x^ - 75296 x® + 
2176 X®) - 17412012166348800 a®® (5605 - 78334x2 -b 136328 x^ - 49952 x® + 
2176x®) -b 311552009639576742899362500000 a^x (123 - 1732x2 -b 6544x‘* - 
9120 X® -b 4224 X®) - 311552009639576742899362500000 a® (17 - 672x2 ^ 
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4000 - 7424 X® + 4224 x®) - 118686479862695902056900000000 a® x (499 - 

6630 x^ + 23560 x^ - 30816 x® + 13376 x®) + 274239191619993600 (11921 - 

199920x2 + 444760 x^ - 238880 x® + 21760 x®) + 

59343239931347951028450000000 (115 - 4444x^ + 25984x^ - 

47552 X® + 26752 X®) + 5656183327162368000 a^®x (424359 

2723796x2 + 4003184 x^ - 1561088 x® + 73984 x®) + 

7912431990846393470460000000 a® X (7341 - 92592x2 + 309536x‘* - 
376640 X® + 150016 x®) - 7912431990846393470460000000 o® (715 - 26788 x2 + 
152384 x”* - 272256 X® + 150016 x®) - 1885394442387456000 a29 (43413 - 
840060x2 + 2243136 x"* - 1565056 x® + 221952 x®) 

1918165331 1 14277204960000000 a^°x (20157 - 242368x2 + 763992x^ - 
860800X® + 308096X®) + 959082665557138602480000000 a“ (3375 - 

121872x2 + 668960x^ - 1155008x® + 616192x®) 

29978652305571121152000000ai^x (211983 - 2320480x2 + 6524340 x"* - 
6264792 X® + 1704864x®) - 4342888260886855680000 a22x (1424718 - 

12271325x2 + 25980972 x^ - 16934880 x® + 2078080 x®) + 

358288281523165593600000 a!® (48491 - 1443298x2 + 6422140 x'* - 

8729812 X® + 3490712 X®) + 11241994614589170432000000 a!® (36587 - 



1210992x2 




6064752 x”! 


9487808 X® + 


4546304 X®) 


— 


2828091663581 184000 a^® x (15025965 


- 108265340x2 


+ 184227288x4 


— 


88762560 X® 


+ 


6125440 X®) + 1414045831790592000 a2!" (1107681 


— 


24079896x2 




74194336x4 - 


62926080 X® + 


12250880 X®) 




371 18703084503040000 x (15659431 


- 124251072x2 


+ 238180224x4 


— 


135547008 x® 


+ 12747008 X®) + 470253369499154841600000 a!® X (3532305 


— 


36755470x2 


+ 


97319348x4 - 


85924160 X® + 


19796160 X®) 





361907355073904640000 a2® (743683 - 19351944x2 + 73808448 x"* - 

82706240 X® + 24936960 x®) - 44786035190395699200000 a!® x (7416360 - 
72992577x2 + 181023556 x'* - 146106480 x® + 27925696 x®) + 

2810498653647292608000000 a!2x (6507637 - 74700980x2 + 222417104x4 - 
231548736 X® + 72980736 x®) - 2810498653647292608000000 a!® (475795 - 
16484016x2 + 86701648x4 - 143211712 x® + 72980736 x®) - 

117563342374788710400000 a!!’ (818493 - 25762098x2 + 122055168x4 - 
178987040 X® + 79184640 x®) - 5302671869214720000 a®® (4380839 - 

104950052x2 + 363264160x4 - 359205952 x® + 89229056 x®) + 

1357152581527142400000 a2®x (37857867 - 350070756x2 ^ 806628912x4 - 
589406592 X® + 91422464 x®) - 271430516305428480000 a2i (9003483 - 
251665140x2+ 1042516880x4 - 1296841600x® + 457112320x®)) / 
(8821612800(12714083695698776015625 - 67808446377060138750000 a® + 
149178582029532305250000 a4 - 179961464035626273000000 a® + 

134554919202394891200000a® - 66992179236769987200000a!® + 

23251106676342504960000 a!2 - 5793671320995317760000 a!4 + 

1055273129097707520000a!® - 141774665486598144000a!® + 

14064801509670912000 a2° - 1023040734363648000 a22 + 

53635589760614400 a^4 - 1963734545203200 a^® + 47436571607040 a^® - 
676457349120 a®® + 4294967296 a®^)) 
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12 Fast Algorithms For Computing Solutions to PWM 
Problems 



We already know that while the slow algorithms of computation of Pn{x) can 
be completed in 0{n?) operations, the fast FFT- algorithms can be completed 
in 0(n log^n) operations. After Pn{x) is determined, we need, in addition, 
determine the set {xi} of all roots of Pn{x). One can use general methods of 
computations of roots of univariant polynomials; that would bring the overall 
minimal complexity higher. We do not need to do it in our case because 
orthogonality properties of Pn{x) allows one to have fast and numerically 
stable methods of computing {xi}. We present one such algorithm, suitable 
for moderate and large range of n. 

Fast polynomial root finding requires first ’’root separation” - determina- 
tion of domains in the complex plane containing only one root. Next, individ- 
ual roots are ’’polished”, i.e., computed with the high accuracy, usually using 
Newton- type iterative algorithms. Both parts of this program of polynomial 
root- finding can be very well accomplished for polynomials Pn(^) in PWM- 
problems relevant for applications. 

In all practically relevant cases of the PWM transcendental ’’sums of odd 
cosines” problem, xi = cosa^ for i = 1, . . . ,n and real a^. This means that 
Xi are real and lie in the interval [—1,1] for all n. Because of the orthogo- 
nality properties of Pn(^), we get the Sturm property: the roots of Pn~i{x) 
separate the roots of Pn{x). This provides a variety of methods of separating 
the roots of any given Pn{x). The simplest method of the overall complexity 
of 0(n^ log^ n), is to compute roots of all Pm{x) for all m = 0, . . . , n recur- 
sively, using roots of the previous polynomial to separate and then accurately 
compute the roots of the next Pm{x) polynomial. Using fast algorithms of 
computations of Pade approximations this method can be improved to the 
overall complexity of only O(nlog^n) operations for computing the set {xi} 
of all roots of a polynomial Pn{x). 

For these fast algorithms we use fast polynomial evaluation: for any given 
set of N points and a polynomial P{X) of degree N in X, one needs at 
most 0{N log^ N) operations to evaluate P{Xi) for all i = l,...,iV. Using 
this technique one can rapidly find true roots Xi of the polynomial P{X)^ 
starting with approximate (but well-separated) values of these roots. In 
the case of single roots (and for all physically relevant PWM problems roots 
are single) one uses a classical Newton-Raphson method. 

For orthogonal polynomials Pn{x)^ arising from the algebraic version of 
PWM problem, we can use fast algorithms of evaluation at n points of Pnipc) 
and Pn{^) to get n Newton-Raphson approximations running at the same 
time: 



P n 



l,...,n. 



Xi — Xi 
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To get rapid (geometric) rate of convergence of this algorithm one needs initial 
conditions of corresponding to centers of intervals separating the roots of 
Pn(x). 

A fast algorithm arises when we start directly from the initial approxima- 
tion to Xi for the given Pn(x). Such approximation can be rigorously derived 
using the classical properties of orthogonal polynomials (see [S]). Roughly, the 
leading term in 1 /n-expansion of (properly ordered) roots X{ of Pn{x) is given 
by asymptotically as follows: 

ni . 

Xi = cosai, Oii ~ ^ 

n 

More accurately, ai are separated by 0(l/n); if these angles are ordered, then 
A/n < — 0 ^ 2+1 < Bln, Thus one can chose Oinje) total starting points 

X in the Newton- Raphson iterations with the property that any real root xi 
is within the distance e/n from at least one starting point x of the iteration. 
In the case of a fixed machine precision the number of iteration is constant, 
providing us with the fast algorithm of computing the set {xi) with at most 
0(n log^n) operations. 



13 Solvable Extensions of the Classical PWM Problem 

A variety of extensions of the ’’odd sums of powers” problem can be solved 
using Fade approximation techniques. For this one uses methods of generalized 
graded Fade approximations developed by authors. Some of these problems 
also arise in practical applications of signal processing (phase unwrapping, 
channel identification, etc.). A particular example of the problem, generalizing 
the ’’odd sums of powers” is the problem where for a given n and N > 1 one 
knows n first consecutive sums of powers of {xi} (i = l,...,n) but every 
A"-th one (i.e., 1,...,A' — 1,A’ + 1,...)* This problem is analytically solved 
using simultaneous Fade approximations to N — 1 functions in a way almost 
identical to the one presented above for A" = 2. 



14 The complete solution to the OPWM Problem 

The main ingredients of the solution to the OFWM problem are: 

1) Analytic solution of the underlying approximation problem by reduction 
to the problem of two-point Fade approximation and orthogonal polynomials 
on the unit circle; 

2) Determination of the variational part of the OFWM problem (the de- 
pendency on the amplitudes of different frequency bands) in terms of the com- 
pletely integrable commuting Hamiltonian flows of Sin-Gordon and Korteweg- 
de Vries type. 
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14.1 The analytic approach to the OPWM Problem. 

Let us look at the periodic ±1 pulse train f{t) with the 2N pulses in the 
fundamental period [0,27 t] and the corresponding pulse edges at aj: 

0 = ao < ai < a2 < ■ ■ ■ < <a2Af+i < ch2N+2 = 27t 

(normalized at /(O) = -1,/(27 t) = 1). Let us look at the Fourier expansion 
of fit): 

OO 

fit) = bo + ^ an sin nt -h bn cos nt. 

n=l 

Let us define now the (complex) Fourier coefficient c„ of fit): 

1 

Cn bn + i- an, Cn = ~ /(t)e""‘dt 

^ Jo 

for n > 0. Then the Fourier coefficients in the expansion of pulse train f{t) 
have the following form: 



N N + 1 

7t6o = Ot2j-l + 7T; 

j=l j=l 



N N4-1 

TTCn = — ^ + 1}; 

” f=i i=i 

for n > 0. 

To put this expansion in a more symmetric form, we associate points Zj 
on the unit circle with the phases (angles) aj: 



Zj = for j 2N + 1. 



This allows us to represent the Fourier coefficients 6 q, Cn of / (t) in the invariant 



way in terms of zj: 



—e 



—inbo 



llj=l -2j-l 

rr^ - 



—nni 

2 



Cn T 1 



N+1 N 

j = l j = l 



(for n > 0). 

Finally we can define the main objects associated with the OPWM prob- 
lem. Theses are the polynomials whose roots are positive 

(respectively, negative) edges of the pulses in f{t): 
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N+l 

Qn+i{z) = 

J=1 

N 

Pn{z) = JJ(Z - Z2j). 

j = l 

From these polynomials a rational 2-point approximation z = 0 

and at 2 : = oo) is formed. The following expansions of this rational function 
is crucial for spectral and orthogonal polynomial interpretation: 

Qn-^i{z) -iM^ 

_ , ■ = z ' e ^M=i Mz^ 8itz = oo. 

Pn{z) 

Here we define the sequence of complex numbers {sm} in terms of the Fourier 
expansion {cm}- 



sm = — ^ — Cm + 1 forM = 1 ,.... 

If we denote by P—(z) the reciprocal of a polynomial P{z) (with a leading 
coefficient one), then we have the second expansion formula of the rational 
approximation : 



= z • e Z-^M=i Mz^ a,t z = 00 . 

Pn(^) 

Here s*j^ is the complex conjugate to sm- These expansions follow from New- 
ton’s original formula of the sums of powers symmetric functions, and repre- 
sent the expansions at 2 : = 0 and at z = cxd of the rational function 
One has only to add to it the definition of bo in terms of constant coefficients 
of the polynomials Qj\[^i{z), Pn{z) (given above): 

^0 —iirbo 

Po 

where qo,Po are, respectively, constant coefficients of Qn-^i{z)^ Pn{z). 

This set of (relatively) classical identities allow us to represent the general 
OPWM problem of representation of an arbitrary harmonic waveform ^(^) in 
terms of the (periodic) ±1 pulse train as the problem of 2-point Fade approx- 
imation (or polynomials, orthogonal on the unit circle). Namely, for harmonic 
polynomial with 2N + 1 arbitrary coefficients Bj^j = 0, . . . , N”; An^j = 
1,...,N: 

N 

= Bo E An sin nt + Bn cos nt. 

n=l 

we have the theorem characterizing the ±1 pulse trains with edges aj that 
have the same leading 2^^ -h 1 coefficients in the Fourier expansion: 
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Theorem. In order for the periodic pulse f{t), as above, to be a harmonic 
approximation to the waveform to within the first 2N+1 harmonic terms, 
it is necessary and sufficient that the rational function 

Qn+i(z) 

Pn{z) 



with Qn+i{z) = - Z 2 j-i); Pn{z) = be a solution 

to the following 2-point Fade approximation problem: 

■ = 2 • foo{z) + 0{z~^) at 2 = 00 

Pn\z) 

where the functions foo{z), fo{z) are ’’conjugate” generating functions of 
Fourier coefficients of 4’{t): 

foo{z) = e ^M = l Mz^ ; 

^ —Mni , ^ . 4 X , .r 

Sm = — 2 — M = 

fo{z) = e~^M=o ^ ; 

Miri 

5* = -TTZ^O, SIj = - i • ^m) + 1, M = 1, . . . . 

This particular class of Fade approximations is closely related to the clas- 
sical class of orthogonal polynomials - polynomials orthogonal on the circle, 
studied by Szego and others. They posses many important properties, one of 
which is of particular use in computations is the existence of the three-term 
recurrence relation, connecting the polynomials Qm-\-i{z), Pn{z) for consecu- 
tive N. 

This naturally assumes that we have a ’’full” harmonic expansion of the 
basic waveform '0(t), looking at its first 2N-\-l coefficients, defining the ’’pulse 
train approximation of the order AT” - f{t) with N 1 up-ticks a 2 j-i, J = 
1, . . . , AT -1-1. 

The three-term recurrences satisfied by the polynomials Qn-\-i{z) for N = 
0, . . and by the polynomials Pn{z) for N = 0, ... is the following one familiar 
from the theory of polynomials orthogonal on the unit circle: 



= {z Dn) • X]\j -h 2: • Ejsf • X]sf-i 

(satisfied by = Qn-^i{z) and hy Xf^ = Pn{z)). An important interpre- 
tation of this three-term recurrence is the difference spectral problem of the 
second order (on X^r), where z plays a role of the spectral parameter (in 
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the complex plane) . Such interpretation allows us to use a Riemann boundary 
value problem and after the reformulation of the Fade approximation problem 
as the matrix (here, 2x2) factorization problem, to get new numerical meth- 
ods of solving it. We can determine the polynomials Qa^+iC^), Pn{^) (staring 
from the original Fourier coefficients Cm of '0(0) only 0(N • log A^) opera- 
tions. The roots zj (and thus whenever they are real) can be determined 
in 0{N • log^ N) operations. 

We present several simple examples of OPWM approximations to trigno- 
metric polynomials. Graphs show the trigonometric polynomial and the 
corresponding pulse train f{t) on [0,27 t]. 




Fig. 1. jp{t) = 1/2 4- (3sin(^))/(27r), f(t), N = 24 



14.2 Soliton solution of the OPWM Problem 

The most interesting, at least in the mathematical sense, thing about the 
OPWM solution is its deep relationship to completely integrable Hamiltonian 
(classical and quantum) systems of the isospectral origin. This relationship 
provides a different characterization of pulse edges in terms of the discrete 
spectrum information in the scattering matrix, associated with soliton type 
solutions. The appearance of an infinite family of commuting Hamiltonian 
flows (representing first integrals of the completely integrable Hamiltonian) is 
natural, because the OPWM problem depends on Fourier coefficients a^, bi as 
independent variables (representing commuting time flows). 
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The first example of such a ’’soliton” representation of OPWM solution 
that we studied arose from the hierarchy of Korteweg- de Vries equations. The 
KdV case arises from the quarter period waveforms that have odd sine terms 
only: 

2N-1 

-tpit) = E a-n sin(n^) 

n=l,n— odd 

The corresponding moment problem in the algebraic form is a problem of 
’’the sum of odd powers” 

2m-l 

y~^ S2m-1 :m=l,...,N. 

2=1 

The identification of time variables tm of m-th order KdV Hamiltonians 
Hm with free parameters is relatively simple 



- '52m— 1 rj 

trrr , , ^ . TTl/ 1. . ^ . 



where t\ = x is the spatial variable of the KdV equation. 

The fact that the variation of the PWM problem in this case is subject to 
KdV equations, allows us to derive various approximations to specific classes 
of PWM expansions. For example, in the most important PWM case of a 
pure sine wave of an arbitrary amplitude ai, one gets a difference differential 
equation in n and ai, that can be used for an efficient and rapid numerical 
integration. This and other consequences of solition studies allow for fast 
and stable numerical methods of computation of pulse edges ai in the PWM 
problem even for very large values of N. 

Without this additional information the case of large values of N had 
been considered impractical to study using conventional techniques of numer- 
ical solutions of a system of N simultaneous transcendental equations in N 
unknowns ai. The reasons for the difficulties include: ill conditioning of the 
Jacobian matrices in the neighborhood of a generic point in the phase space 
(these matrices are of Hankel type and resemble famous Hilbert matrices). 
Straightforward applications of conventional computations of Pade approxi- 
mants is also unfeasible for large V, due to the presence of a very large number 
of singular values in the corresponding Hankel matrices. Only because of the 
control of commuting difference and differential operators, whose eigenfunc- 
tions are the polynomials Pn{^) in PWM problem, a stable determination of 
polynomials Pn{^) and their zeroes Xi = cosa^ is possible for moderate and 
large N. 
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15 The Boundary Value Riemann Problem 

The key for the solution of the general Fade approximation problem, as well 
as to the problem of its isospectral deformation is provided by boundary value 
problems, and by integral equations associated with them. 

A classical complex variables boundary value problem is called the Rie- 
mann problem. It can be formulated in the following way. Let T be a closed 
contour (possibly, containing oo) in the complex plane A and let G(A) be a 
matrix function (of order m) defined over F. We need to find a matrix func- 
tion '0i(A) analytic inside the contour, and a matrix function ' 02 (A) analytic 
outside the contour, and and -02 on the contour satisfy the following con- 
dition 



'0102 = G(A). 

Solution of a regular Riemann can be reduced to solution of a system of 
singular integral equations on the contour F. Let the Riemann problem be 
normalized at a point Aq e.g. outside the contour F and let 



02(Ao) == p. 

We are looking for the solution of the Riemann problem in the form 



inside the contour, and 



outside the contour. Then the normalization condition yields 




Thus we arrive at a singular integral equation on for A G T 



» - 01 ^^ + 7rzv?(A)T(A) + ^ = 0, 

where T = . The solution of this integral equation solves the (regular) 

Riemann problem. 
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16 Isospectral Deformation Equations 

The appropriate isospectral deformation equations arising from the general 
OPWM problem is the same one that appears in the chiral a - model, and, 
in particular, in Sin - Gordon equation. These equations like other isospectral 
deformation equations arise from the consistency conditions between (matrix) 
differential operators, depending on a parameter A (typically called a spectral 
parameter). The general form of these consistency type differential spectral 
problems is 



- t/(A))v = 0, 

(^-K(A)V = 0. 

where u — f/(A), V — 1^(A) are m x m matrices, rationally depending on 
the (spectral) parameter A. The consistency condition for these equations is 
known as a ”Lax” pair and is 

This is actually a system of nonlinear partial differential equations on 
residues of U{X), their poles independent of r/, 
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1 Introduction. 

This paper deals with a very important practical problem of constructing the 
"best’' spline approximation to a curve or a surface. Needless to say, this prob- 
lem has many practical application, especially in cases of data modeling and 
data fitting. In this paper we are dealing with curves, in one- and multi- di- 
mensional cases, and, in particular, with closed curves. The main methods of 
this paper are methods of (generalized) Pade approximations and orthogonal 
polynomials on the interval and the unit circle, and related methods of me- 
chanical quadrature formulas. For references to the approaches to the spline 
reconstruction of a data curve, where a variety of ’’fitting” figures of merit is 
used, see [KH], [MW], [MP], [W], [M95], [C95], [M97], [MM], [LM94], [Z]. 

The basic approach we use in this paper is that of proper ’’spectral” match 
of the spline approximation to the original data curve. This corresponds to 
the assumption of perceived similarity between objects based on matching 
according to a certain band-limited frequency characteristic curve. This as- 
sumption is used to justify the use of band-limited information extracted 
from the original data curve to match the (polynomial) spline approxima- 
tion to such a curve. Even in the simplest (but the most important) case 
of matching the data curve with piecewise-constant approximation the con- 
cept of band-limited approximation leads to interesting generic observations 
about the number of independent degrees of freedom of the original data 
that can be captured in the approximations. The simplest data fit is that of 
fixed locations of N knots of the spline (sample points) where the values can 
be arbitrary numbers matching N (real) spectral coefficients of the original 
data. This subject is well covered by Nyquist theorem and related studies of 
band-limited functions sampled, typically, at uniformly distributed N points. 
On the other hand, we have recently studied so-called PWM approximations, 
where the sample points are arbitrary (in a given interval of observations) but 
the values are limited only to 2 - we approximate data with pulse trains of N 
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arbitrary width but equal height pulses whose spectral characteristics match 
N (real) spectral characteristics of the original data (see, e.g. [Ch99]). In this 
paper we consider the case of variable locations of sample points (spline knots) 
and values of approximating function. This gives the total of 27V degrees of 
freedom for TV knots (sample points) - ’’twice the Nyquist limit”. 

The proposed method uses an analytic construction of the spline curve 
best approximating the data curve by matching the maximum number of 
Fourier coefficients (trigonometric moments). It is based on technique of gen- 
eralized (two-point) Fade approximations and orthogonal polynomials on the 
unit circle, and is very similar to the PWM technique from [Ch99]. 

We also describe and generalize a somewhat related approach of construct- 
ing splines matching the monomial moments of the approximated function on 
the interval studied in a series of papers starting from [G84]. In the case of 
an approximation of a single function f{t) on intervals [0, oo) and [a, 6] by 
splines that match the maximum number of polynomial moments on inter- 
vals, the reduction to the classical (Gauss) problem of mechanical quadratures 
was obtained by Gautschi [G84], Gautschi, Milovanovic, and Frontini [F87], 
Gautschi, Milovanovic [G86] (see also Micchelli [M88] and related work in 
[MK88], [MOO]). Precise formulations of these results are presented below. An 
important and related problem of periodic spline approximations was left open 
in publications quoted above, and is solved. 

A spline function of degree m > 0, with n distinct knots ti,T 2 , . . . ,Tn in 
the interior of [0, 1], can be written in terms of truncated powers in the form 

n 

^n,m — Prn T ^ ^ ^ ^ — I^ 

1^=1 

where are real numbers and pm is a polynomial of degree < m. 

Problem. Determine spline Sn,m such that 

[ t^Sn,m{t)dt = [ 

Jo Jo 

holds for j = 0, 1, . . . , 2?7 — 1 and 

Here it is assumed that / has m derivatives at ^ = 1, all being known. 

Define 



H 



(m-l- j.+ 1)! 



m\j\ 



m 

/ 

k=0 



/«(!) 

A:! 



{t - l)'^]dt, 



for j = 0, 1, . . . , 2n — 1. This gives rise to linear functional £ on polynomials 
of the form 6 P 2 n-i (P<j is the set of polynomials in t of degree 

at most d), defined by its values on monomials: 
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£(t-+i -iJ) = j = 0,l,-,2n-l, 
and the inner product 

< p,q>= -q), P-Q & P2n-i- 

The orthogonal polynomials is defined by 



deg 7Tn = n, 7Tn = + . . . , 

< TTn, g > = 0 for all ^ G Pn-I- 
Theorem. There exists a unique spline function on [0.1], 

n 

Sn,m = Pm{t) + a^{Tu ~ i)+ , 0 < Tj. < 1, ^ for V jl, 

satisfying moment equations of Problem above, if and only if the orthogo- 
nal polynomials 7Tn above exist, are unique, and have n distinct real zeroes 
If =z 1,2, ... ,n, all contained in an open interval (0, 1). The knots of 
Sn,m are then precisely these zeroes, 

'Ti. = = 1,2, . . . ,n; 

the polynomial Prn is given by 



Pm{t) = 



m 



E 



A:! 






and the coefficients are obtained uniquely from the linear system 
£o(i™+V) = £(i’"+'p) forallp€ P„-i, 



where 

n 

^o{g) = = 

U=1 

More complicated spline approximations where degrees of piecewise ap- 
proximations vary among the knots also can be described by generalized or- 
thogonal polynomials and Turan’s quadrature formulas (see [MOO]). 

We generalize these and other similar results below for the case of ap- 
proximating spatial d-dimensional curves g(t) by spatial splines defined on 
the interval, that match the maximum (allowed) number of moments on that 
interval. For this we use the technique of Fade approximations of the Second 
Kind and related (multi-index) orthogonal polynomials of the Second Kind. 

The main problem with the moment-matching (moment-preserving) spline 
approximations lies with the instabilities in moment problem calculations in 
cases of moments on infinite and finite intervals. Some of these problems are 
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related to ill-conditioning of related Hankel matrices, and some problems are 
related to singularities arising from non-normals cases, and cases when roots 
are not real or lie off the interval that supports the measure. 

The trigonometric moments, and related polynomials with zeroes on the 
unit circle do not have such instabilities; they are more advantageous for con- 
struction of spline approximations. They also provide a unique framework for 
the construction of periodic splines approximating periodic functions (e.g., 
closed planar or spatial curves). Our results solve the problem of approxi- 
mating periodic functions (curves) by periodic splines, left open in previous 
studies. The technique here differs considerable from the traditional approach 
of Gauss’s mechanical quadratures (and their generalizations), and involves 
two-point Fade approximations (at 2 = 0 and 2 : = cxd) and quadratures at the 
unit circle. 

Unlike the corresponding figure of merit based on matching of moments 
on the interval, figure of merits based on matching of trigonometric moments 
always has ” the best” solution (with the minimal number of knots) . This solu- 
tion is quite practical, fast, and also matches correctly piecewise-polynomial 
input data. The solution is easily generalized to the case of (closed) multi- 
dimensional curves. 

There is no obvious generalization of this method to the surfaces and 
other multi- dimensional manifolds due to the absence of any useful general 
multi-dimensional cubature formulas and related multi-dimensional orthog- 
onal polynomials. Though there are some specific cubature formulas (like 
Radon’s two-dimensional theorem), they seem to work for very specific num- 
bers of knots, and cannot be used with, say, adding one extra knot at a time. 
Nevertheless, one can use our approach with the curves on multi- dimensional 
manifolds to find the best knots on the ” scan-curves” of the manifolds, con- 
structing this way multi-dimensional spline patches. 



2 Multi-dimensional Spline Curves and Fade 
Appr oximat ions . 

2.1 Simultaneous Fade Approximations of the Second Kind. 

To deal with the problem of analytic construction of d-dimensional spline 
curves of arbitrary degree m, we apply the methods of simultaneous Fade 
approximations of the Second Kind, see [Ch85]. Just as the ordinary Fade 
approximants are expressed in terms of (general) orthogonal polynomials, si- 
multaneous Fade approximants of the Second Kind are also expressed in terms 
of simultaneous orthogonal polynomials (or a solution to a multiple moments 
problem). Here are the alternative definitions. 

Definition of One-Point Fade approximations of the Second 
Kind. Let fi{z) . . . fd{z) be a system of functions defined by their Taylor 
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expansions at z = oo. Let n = (^i, • • . , be a multi-index with non- 
negative rii^ i = Then the polynomial Pn{z) in z of degree (at 

most) |n| = ni + . . . + is called a Fade approximant of the Second 
Kind to at ^ = cxD of the weight n if there are polynomials 

Qi{z) i = ^ d in z of degree at most |n| such that the following approxi- 

mation holds. 

The Laurent expansion at z = cxd of Qi{z)/ P{z) coincides with the Taylor 
expansion of fi{z) up to (and including) the term for alH = 1, . . . , d. 

An alternative definition of these approximants involve orthogonal poly- 
nomials of the Second Kind. Measures defining these polynomials can have 
support on unions of arbitrary arcs in the complex plane. However, for a typ- 
ical application that we study here, the support of all d measures is within 
the same interval [a, 6], that can be chosen, without any loss of generality, to 
be [0, 1]. We consider d measures diii{t) on [0, 1], where often these measures 
are defined by means of weight functions Wi{t)dt (for i = l,...,d). These 
measures produce the simultaneous problem of moments and the class of or- 
thogonal polynomials of the Second Kind on the interval, as in the following 
definition. 

Definition of Orthogonal Polynomials of the Second Kind. Let 



diii{t) be d measures on the interval [0, 1] (i = 1, . . . , d). Let n = (ni, . . . , n^) 
be a multi-index with non-negative rii.,i = 1, . . . , d. Then the polynomial Pn{t) 
in t of degree (at most) |n| = rii rid is called an orthogonal polynomial 

of the Second Kind with respect to measures d/j.i(t), ... , dfXd{t) on [0, 1] of 
the weight n if the following orthogonality relations hold. 

/ P Pxv{t) d^J.^{t) = 0, j = 0, . . . , ni 
do 



for ^ = 1 , . . . , d. 

The relationship between the (one-point) Fade approximants and orthog- 
onal polynomials of the Second Kind is fairly straightforward, just as in the 
classical case of d = 1 of the classical moment problem. For this one converts 
the (measure) moments into their generating function, expanded at 2 : = oo, 
as follows. 

For i = 1, . . . , d, we define the generating function fi(z) in terms of the 
moments of the measure d/di{t) on [0, 1]: 



r / \ 

= L 

k=0 

where 

rrik,i = [ t^dfii{t), k = 0, 

do 

In these notations the problem of constructing simultaneous orthogonal 
polynomials and (one-point) Fade approximants of the Second Kind for the 
multi-index n are equivalent. 
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The construction of the polynomial Pn is reduced to the problem of lin- 
ear algebra via a standard Gramm-Schmidt reduction, and an inversion of 
the (generalized) Hankel matrix. There are two important conditions on the 
measures that are often imposed. The first one is the ’’normality condition” 
of the Fade approximations. This condition means that the polynomial Pn is 
unique (up to a multiplicative constant factor), and has the degree exactly |n| 
for all multi-indices n. This condition is equivalent to the condition of non- 
singularity of all corresponding (generalized) Hankel matrices. The ’’normality 
condition” holds generically, and its violation means that there are non-trivial 
relations between the approximating functions. The simplest necessary condi- 
tion for ’’normality” is the linear independence of functions fi{z), i = 1, . . . , d 
over C{z) (or H{z)). 

The second condition concerns the distribution of zeros of orthogonal poly- 
nomials. Under very mild conditions on the behavior of functions fi{z) (re- 
lated to their singularities), the distribution of zeros of polynomials Pn(t) 
approaches the union of supports of measures dfii{t). Stronger conditions en- 
sure that all zeros of polynomials Pn{t) are simple and are located in the 
interval [0, 1]. For d = 1 one such condition is the condition that the weight 
wi{t) does not change sign on the interval [0, 1]. For d > 1 the conditions on 
weights Wi{t) are stronger, of the form of the Chebyshev system. 

Since the construction of orthogonal polynomials implies the solution of 
linear equations, these polynomials can be expressed in terms of determinants 
of generalized Hankel matrices. Assuming normality, the monic orthogonal 
polynomial Pn{t) corresponding to a multi-index n = (ni, . . . ,Ud) of degree 
|n| can be expressed as a ratio of two determinants: 



/ rno,i mi,i 



^|n|,l \ 



Pn{t) 



1 



^ni — 1,1 ^ni,l 

^l,<i 



^|n|-f ni — 1,1 
^|n|,d 



I '^nd — l,d '^ridid 

\ 1 t 



^|n|+nd-l,d I 

) 



where 
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mo,i 


mi^i 


^|n| — 1,1 ^ 


^ni — 1,1 


^ni ,1 


. . . 2,1 


mo,d 


mi^d 


. . . ‘^|n| — l,d 


\ '^rid — l,d 


'^rid,d 


. . . ni|n|-l-nd— 2,d / 



This representation, if used directly, often leeds to ill-conditioned calcula- 
tions even for moderate n. Faster, and more stable, algorithms are sometimes 
based on recurrence relations expressing Pn{t)- In case d = 1 the relation is 
a simple three- term linear recurrence, connecting 3 nearby monic orthogonal 
polynomials: 



Pn(t) = {t bn) • Pn-l{t) ~ Cn * Pn- 2 {t) , 



n = 2, . . . . 

This three-term recurrence also allows to reduce the problem of finding 
zeros of ordinary orthogonal polynomials to the problem of determining eigen- 
values of the three-diagonal matrices (formed from the three-term recurrence 
relation). 

In the case of d > 1, we have linear recurrences of the order d+ 1 (of length 
d+2) connecting orthogonal polynomials of the Second Kind. The easiest way 
to express these relations is to look only at diagonal or near-diagonal cases of 
simultaneous approximations. Namely for every N, we create the multi-index 
n(AT) = (^+ 1, . . . , g + 1, g, . . . , g) (g + 1 is repeated r times, and q is repeated 
d — r times) for N = q • d -\- r. Then orthogonal polynomials of the Second 
Kind corresponding to multi-indices n{N) satisfy the recurrence of order d+1: 

d -\- 1 

Pn{N) = t'Pn(N-l) + bk ' Pn(N-k)^ 

k=l 

TV = d+ 1,.... 

2.2 Multi-dimensional Spline Curves and Moment Problems. 

In order to find an analytic solution to the problem of construction of multi- 
dimensional spline curves using moment approximation we use the method of 
simultaneous Fade approximations from the previous subsection. 

Here we use as a figure of merit of the approximation of the multi- dimen- 
sional curve g(t) = . . . ,gd{t)) for t in [0, 1] by a spline of degree m a 

coincidence of maximum allowed number of moments in t. Here d is the di- 
mension of the Euclidian space R^, where the approximated curve g{t) lies. In 
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order to avoid trivial cases, that will result in non-” normality” of Fade approx- 
imations one should assume that the curve g{t) is essentially d-dimensional, 
e.g. functions , gd(t)) are linearly dependent over R. The minimal 

assumption on g(t) is the integrability of gi{t) on [0, 1]. 

We are looking at a d-dimensional spline of degree m sp{t) defined on [0, 1] 
that approximates g(t) in the sense of matching the highest (expected) number 
of moments on [0, 1]. Specifically, for a multi-index n = (ni, . . . , Ud), we are 
looking at the d-dimensional spline of degree m sp{t) = (spi{t ), . . . , spd{t)) 
defined on [0, 1] with N = |n| knots U, i = 1, ... ,A/" in [0, 1] (0 < < 

... < In < 1), matching the moments with g(t) on [0,1]. This matching 
can be formulated in one of the two following statements (as in the one- 
dimensional case): 

Problem Ml. Determine the A^-knot spline sp{t) = (spi(t), . . . , spd{t)) 
of degree m, such that 

f • spi{t) dt = f ‘ gi{t) dt, k = 0, N -h rii m, 

Jo Jo 

for i = 1 , . . . d. 

The second problem assumes that g(t) has the first m known derivatives 
at t = 1, and we are trying to match these derivatives (at t = 1), in addition 
to the maximal number of moments: 

Problem M2. Determine the Wknot spline sp(t) = (spi(t), . . . , spd{t)) 
of degree m, such that 

f • spi{t) dt = f ' gi{t) dt, k = 0, . . . , TV -h — 1, 

Jo Jo 

and 

spf\t) = A; = 0. ... ,m, 

for i = 1, . . . d. 

Any d-dimensional spline of degree m sp(t) = (spi{t ), . . . , spd{t)) defined 
on [0, 1] with N = |n| knots tj, j = 1, ... ,N can be represented in the 
following form: 

N 

spi{t) = Pi{t) + 'Yh ~ *)+ ’ 

j=i 

where Pi{t) are polynomials of degree at most m, for i — 1, . . . , d. 

Problems M1-M2 of optimal construction of spatial splines sp(t), approx- 
imating original spatial curve g(t) can be reduced to the problem of finding 
zeros of orthogonal polynomials of the Second Kind and related (spatial) me- 
chanical quadrature. The corresponding moment problems and functions that 
are Fade approximated arise from (m + l)-st differentiation, when g(t) is, 
essentially, approximated by a linear combination of delta- functions S(t — tj), 
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centered at the splines’s knots (zeros of orthogonal polynomials of the Second 
Kind). This reduction is very similar to the one-dimensional case, and involves 
mainly integration by parts. 

We’ll present the d-dimensional moment problems equivalent to Problems 
M1-M2. These moment problems can be actually reduced to d-dimensional 
moment problems with d weights on [0,1], provided that g(^) if (m + 1)- 
differentiable: G [0, 1], z = l,...,d. 

First, the case of the Problem Ml. The corresponding d sequences of mo- 
ments k = 0, . . . are: 



^k,i = 



(m -f /c -f 1)! 
mlkl 



■f 



t^gi{t) d^, /c = 0, . . . , 



for i = 1,. . . ,d. As usual, these moments extend by linearity to linear func- 
tionals Cj defined on the set P of polynomials q{t) in t using the following 
definition of C] on monomials 

= mli, k = 0,..., 



for i = 1,. . . , d. These linear functionals are extended to the scalar product, 
defined for all polynomials p{t) and q{t) from P: 

<p,q>l= CU{l-tr+^ -Pit)- q{t)). 



i = 1, . . . , d. 

The orthogonal polynomials of the Second Kind of the multi-index n corre- 
sponding to scalar products <.,.>|,i = l,...,d have zeros that correspond 
to knots ^ 1 , . . . , for N = |n|. To determine the parameters j = 1, . . . , A” 
and coefficients of polynomials Pi{t) one needs to introduce other linear func- 
tionals, arising from m + 1-st differentiation of gi{t): 

m N 

k=Q j=l 



where ^ 

bk,i = k = 

for i = 1, . . . , d. 

Theorem There exists a unique d-dimensional spline approximation sp(^), 
solving the problem Ml for a given multi-index n = (ni, . . . , n^) iff there exists 
a unique orthogonal polynomial of the Second Kind Pn{t) such that 

Pni q 0 ? 

for all polynomials q{t) of degree < rii for all i = 1, . . . ,d having N = |n| 
real zeros tj^ j = 1,..., A in the interval (0,1). The parameters and 
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polynomials Pi (^) (defined by their expansion coefficients bk,i) are determined 
from the block- Vandermonde equations 



= Cl(q) 

for all polynomials q{t) of degree < N -j- rii and alH = 1, . . . , d. 
The proof follows from integration by parts that gives: 




klml 



(m-h k + 1)! 



7Tl-\-k-\-Y 



f 






k\m\ 



{m-\- k -\- 1)! 






d- 



rm — / 






^m+l+/c I 



and thus: 

• i’") = k = 0 > • • • . 

for i = 1, . . . ,d. 

Provided that ^{t) is (m + l)-differentiable: gi{t) e [0, 1], i = 

one can replace the general moment problem for m\ ■ by a prob- 
lem, where the measure djii{t) is a weight: 



dgi{t) = 



m.\ 



(t) dt^ 



for i = 1, . . . ,d. This means that the inner product < >1 can be repre- 
sented as: ^ 

<p,q>l= / p{t) ■ q{t)diM{t), 

Jo 

i = 1, . . . , d. 

Then the unique solution to the d-dimensional spline problem Ml exists for 
a given multi-index n if and only if the corresponding orthogonal polynomial 
of the Second Kind Pn{t) is unique of degree V = |n| (this is true under the 
normality assumption), and its N roots are all real and lie in the interval 
(0, 1). Assumptions of normality of the function system g(t) are necessary for 
this, and in the case of d = 1 the positivity of on [0, 1] is a sufficient 

condition. It is satisfied when, for example, g\{t) is a monotonic function on 

[ 0 , 1 ]. 

The solution to the problem M2 is very similar, but one introduces slightly 
different moments and related linear functionals. Also the definition of the 
polynomials Pi{t), denoted as p^(t) is very simple in terms of values of deriva- 
tives of gi{t) oX t = 1: 



3=0 d- 

for i = 1, . . . ,d. The moments {m\ in the Problem M2 are: 
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'^Ia 



(m + A: + 1)! 

m\k\ 



nl rn 

■^0 J=0 



5 ^( 1 ) 



) dt, k = 0 , . . . , 



and these moments extend by linearity to linear functionals defined on the 

set P of polynomials q{t) in t using the following definition of on monomials 









fc = 0,..., 



for z = 1, . . . , d. These linear functionals are extended to the scalar product, 
defined for all polynomials p{t) and q{t) from P: 



<P,Q>i= ■ q{t)). 



i = 1, . . . , d. To determine the parameters aj^i, j = 1 , . . . , one needs to 
introduce other linear functionals, arising from m + 1 -st differentiation of 
9i{t)' 

N 

^0,iW = H 

i = T, . . . , d. 

Theorem There exists a unique d-dimensional spline approximation sp(A), 
solving the problem M2 for a given multi-index n = (ni, . . . , rid) iff there exists 
a unique orthogonal polynomial of the Second Kind Pn{t) such that 

Pm Q — O 5 

for all polynomials q(t) of degree < for all i = 1, . . . ,d having N — |n| 
real zeros tj, j = in the interval (0,1). The parameters aj^i are 

determined from the block- Vandermonde equations 

Clyt^+^-q) = CUq) 

for all polynomials q{t) of degree < N rii and all z == 1 , . . . , d. 

Provided that g(t) is (m -K l)-differentiable: gi{t) G [0, 1], z = 

1 , . . . , d, one can replace the general moment problem for rri^ ^ by a prob- 
lem, where the measure d/Xi(A), denoted as d/z?(^) is a weight: 

dglit) = ^ \ 



for z = 1 , . . . , d. 

The main problem with construction of moment-preserving spline approx- 
imations on the interval (infinite or finite) to generic functions is its inher- 
ent instability. Corresponding linear problem are notoriously ill-conditioned 
are require nearly quadratic increase with N in precision of computations, 
making it prohibitively difficult even for moderate N. These difficulties are 
significantly less pronounced in the case of periodic splines, or for other simi- 
lar problem with functions defined on the (part of) the unit circle, when the 
knots (zeroes) are lying on the unit circle. 
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3 Trigonometric Moments Problem and Periodic Splines. 

3.1 Caratheodory Theorem. 

A crucial result used in the solution of the trigonometric moment problem is 
known as the Caratheodory theorem: 

Theorem (Caratheodory). For any N complex numbers ci, ... ,cyv 
there exists a unique M < N , positive real numbers pi, ... ,Pm and distinct 
real numbers ... , 6m such that — tt < 6j < tt, j = 1, . . . , M and 

M 

The proof of this Caratheodory theorem is important because the same 
method is used in some of our proofs as well. We use the classical method de- 
scribed in [GS]. First, one extends the sequence no negative indices 

as follows: c-k = for /c = where c* is the complex conjugate of c. 

Then we construct a (A + l) x (A-f 1) ’’zero-diagonal” Toeplitz matrix T such 
that Tij = Cj-i for i ^ j and z, j = 0, . . . , A and Ti,^ = 0 for i = 0, . . . , A. 
Let us denote by Aq the smallest eigenvalue of T, and define (for the first time 
in the context of the Caratheodory theorem) cq as cq = — Aq. Then one can 
defined a” full” {N + 1) x (N + 1) Toeplitz matrix Tm with = Cj-i 

for ij = 0 ,..., A. 

We can define M < N such that N — M \s the multiplicity of the eigenvalue 
Aq. Then M is the rank of T,v- One looks then at the leading (M4-l)x(M-l-l) 
sub-matrix Tm of T^v with {TM)ij = Cj-i for i.j = 0, . . . ,M. Then Tm, as 
well as Tjv has a zero eigenvalue. Let e be the zero eigenvalue of Tm, where 
e = (eo, . . . , Cm)- 

From e one can construct the polynomial Qe(^), defined via elements of 
e, Qe{z) = polynomial Qe{z) of degree M has only simple 

complex roots, and all these roots lie on the unit circle. We denote them as 
e'^^3 for j = 1, . . . , M. The weights pj {j = 1, . . . , M) are defined from the 
Vandermonde system of equations: 

M 

ck = k = 1,... ,N. 

3 = 1 

The weights pj are positive, and Pj = cq. This proves Caratheodory 

theorem. 



3.2 Approximations of Periodic Function by Periodic Splines. 

We start with the one-dimensional case of a real periodic function f{t), defined 
on [0, 27t] (periodic with the period 27t) and having well- defined leading N -]-l 
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(complex) Fourier coefficients cq, ... , ca/^. Because f{t) is real, one extends 
to negative k by defining: C-k = for positive k. 

Since we consider a problem of approximating a periodic function by peri- 
odic splines, we can use instead of f{t) its truncated expansion in the Fourier 
series: 

N 

fN{t) = ^ 

k=-N 

where 

The periodic spline sp{t) on [0, 27 t] can be defined in two different ways, 
depending on whether the end points of the period are included in the list 
of knots. In the simpler definition, the periodic spline sp{t) of degree m on 
[0, 27 t] has M knots, starting with to = 0. These are: 

0 == to < • . • < tM~l < 27T, 

and we can add by periodicity = 27t. The spline has the generic represen- 
tation: 

M-l 

sp{t) = Pm{t) + ^ ai{ti - t)^, 
i=l 

with additional periodicity conditions satisfied: 

sp^^\2tt) = A: = 0, . . . , m — 1. 

This leaves a total of 2M — 1 real variables in sp{t) (M — 1 for variable 
knots ti, i — 1 and total M for free coefficients: ai at — i — 

1, . , . , M — 1 and for a constant term oi pm{t)). This allows us to match a total 
of 2M — 1 real Fourier coefficients of sp{t) with Fourier coefficients of /m-i(A) 
(or f{t)), or Cj for j = 0, . . . , M — 1. The periodicity conditions uniquely define 
all but the constant coefficients of Pm{t) in terms of a^, i — 1, . . . , Af — 1. 

Let us write prn(0 so that the leading coefficient ofpm{t) 

(at is Pm,m’ Then the periodicity conditions uniquely determine all non- 
constant coefficients Pm,k for k = 1, . . . , m in terms of a^, i = 1, . . . , Af — 1, 
from differences of derivatives at f = 27 t and f = 0: 



1 1 ^ 

^ k = 0, . . . ,m — 1. 



The constant coefficient Pm,o of pm{t) is not determined from the periodicity 
condition, and is determined by equating the constant Fourier coefficients of 
sp{t) and /M-i(f) (i-e., cq). 

After m-th order differentiation we can reduce the problem of fitting m-th 
degree spline to the problem of fitting piecewise-constant function (spline of 
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degree 0) with M jump points to the Fourier expansion of In the 

notations above, {U — t)^ is a unit step- function with a jump ai t = ti. 

Using integration in parts we get for any 27r-periodic function h{t), 

p2tt p2tv 

/ = —{in) / h{t)e^'^^ dt. 

Jo Jo 

This means that in order to determine m-th degree spline sp{t) of the form 
above (or more specifically, its free parameters: ti^i = — 1, coefficients 

ai at {ti—t)'^ for i = 1, . . . , M — 1 and a constant term Pm,o)) we need to solve 
the problem of matching the M — 1 Fourier coefficients c^, — 1 

of/lT-iW: 

M-1 

/iT-iW = E iikrcke^’^^ 

k=-{M-l) 

4 = ■ Ck 

with M — I Fourier coefficients of the piecewise-constant function s{t) on 
[0, 27 t] with M jumps at U. Assuming, as above, that to = 0 and Im = 27t, 
we define s{i) on [0, 2tt) as follows: 

s{t) = Pi for t G [ti,ti+i), i = 0, ... ,M - 1, 

and extend s{t) everywhere on the real axis by 27r-periodicity. The relationship 
between pi in the definition of s{t) and coefficients (i = 1, . . . , M — 1) and 
the leading coefficient Pm,m (of Pm{t)) in the definition of sp{t) is the following 
one: 

M-l 

(3i = + {-O'" X] a*) for i = 0, 1. 

We also define (by periodicity) Pm = Po- 

It is straightforward to compute the Fourier coefficients of the (peri- 
odic) piecewise-constant function s{t) on [0,27 t]. For this let us denote Zj = 
g-itj ^ j _ 0^ . . . ^ Af , so that zq = zm = 1 as assumed above. Then 

1 p2tt -j -j m-1 

i=0 

for n ^ 0. Changing the summation order, remembering that zq = zm^ Po = 
Pm, and adding a new notation p-\ = Pm-i, we get: 



1 

2tt 




s{t)e~^^Ut = — — 
27 t in 



M-l 

E 



i=0 



n 7 ^ 0 



= Pi - Pi-1, i = 0, ... ,M- 1. 
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In terms of a^, the definition of vi is straightforward: 

= (—1)"^"^^ m\ai ioT i = 1, . . . , M — 1, 



M-l 

VO = (-irm! Y^au. 

k=\ 



This leads to the complete system of equations on tj, a^, i = 1, ... , M — 1 
in terms of the Fourier coefficients Ck of f{t): 



M-l 



27T • • Cn = ^ ^ ^ 1, . . . , M - 1, 

J=0 



M-l 



I] = 0 



J =0 



M-l 



j=l k=Q 

These equations are supplemented by: 

Vi = (—1)"^“^^ m! Ui for 2 = 1, . . . , M - 1, 



fc+i 



M-l 



1 

A: = 0,...,m- 1. 

i=l 



kVdt^^ 



3.3 Solution of Periodic Spline Problem. 

The system of equations above provides the solution to the problem of spline 
approximation using matching of trigonometric moments. As the equations 
show, the problem is entirely reduced to the solution of the following main 
trigonometric problem: 

M-l 

Cn = J/j • 2:", n = 1, . . . , M - 1, 

j=0 



M-l 

= 0 

3=0 

Here Cn are properly modified Fourier coefficients: 

C„ = 27t • ■ Cn n^Q, 

and Zj are lying on the unit circle: Zj = j = 0, ...,M — 1, so that 

zo = 1- 
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There are two slightly different solutions to this problem. One is based on 
the standard approach using Fade approximation; this time it is a two- point 
Fade approximation at 2 ; = 0 and z = 00 . While straightforward, it leaves open 
the usual questions of normality and rigorous proof of the location of zeroes 
of Fade approximants on the unit circle. An alternative approach is based 
on the Caratheodory theorem, and provides the necessary simple criteria of 
normality and the proof of zeroes’ location on the unit circle. We present both 
solutions that compliment each other. 

To arrive at the Fade approximation representation, we introduce the poly- 
nomial Q(z) of degree M: 



Q{z) 



M-l 



n 

j=0 



and a rational function R{z) = 
of two polynomials: 

R(z) = 



— which can be written as a ratio 

Z-ZJ ’ 

Pjz) 

Q{z)' 



Because of the equations on j/j, deg^{P) < M — 2 (or R{z) —> 0{z~^) as 
z ^ oo). Expanding i?(z) at z = 0 and z = oo, and using the fact that 
Zj lie on the unit circle (zj == z“^) and are real, we can see that R{z) 
approximates two series expansions associated with Fourier coefficients C±j, 
j = 1, . . . , M — 1. The two expansions are: 



M-l M-l 

f+{z) - y; Ckz-‘^-\ f-{z) = - 

k=\ k=\ 

at z = oo and z = 0, respectively. Everywhere above and below we set Co = 0 
and C-k = for positive k. 

The expansions of R{z) at z = 0 and z = oo are 



R(z) = ^ 



-k — 



k=0 



M-l 

j=0 



at z 



CXD, 



oo M-l 

R{z) = -53^''- E pS+i atz->0. 

k=0 j=0 

Thus the approximations properties of R{z), equivalent to the original equa- 
tions, are the following: 

= f^{z) -h 0{z~^~^) at z — > oo, 

^{z) = f~{z) + 0{z^~^) at z 0. 

These approximation properties can be replaced by 
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P{z) = Q{z) • f~^{z) + 0{z at z -> cxD, 

P{z) = Q(z) • f~{z) + 0{z^~^) a.t z 0. 

The two approximations above can be written as a system of linear equations 
on coefficients of the polynomial Q{z). If Q{z) is expanded in powers of 2 : as: 

M 

j=o 

then we get the system of linear equations on Qj, j = 0, . . . , M: 

M 

■ Cj-k-i = 0, k = 0, ... ,M-2. 

j=Q 

In the equations above we set Co 0 and C-k = for positive k. 

The system of equations on Qj above (M — 1 of them) is supplemented by 
the following one: 

M 

^ Qj ~ 

equivalent to the statement that Q{1) = 0 or zq = 1. In the case of nor- 
mality of two-point Fade approximation, the system of M equations on Qj 
uniquely determines the polynomial Q{z) up to the (normalization of) the 
leading coefficient Qm; so one can add M -h 1-st equation 

Qm ~ 1- 

In the generic case the two-point Fade approximations are normal and 
polynomials Q(z) have all roots distinct and lying on the unit circle. How- 
ever, this approach based on Fade approximations alone (or solution of the 
system of Toeplitz equations) does not always guarantee the solution of the 
problem of finding the periodic spline with proper matching trigonometric 
moments. Here the situation is similar to that of splines matching moments 
on the interval, where sometimes the orthogonal polynomials have zeroes out- 
side of the interval that supports the measure. The problem of matching the 
trigonometric moments is different - it always has solution. Moreover, this 
solution (as a function of coefficients Ck) is stable. 

The explicit solution to the problem of finding the periodic spline with 
proper matching trigonometric moments can be derived from the Caratheodory 
theorem (or, more precisely, from the method used in the proof of the 
Caratheodory theorem). In this approach we start with the system of equa- 
tions, as in the proof of Caratheodory theorem: 

M-l 

Cn = ■ Z^, n = 0, ... ,M -1, 

j=l 
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Here C^, n — — 1 are variables, while Cq is dependent on n = 

1 , . . . , M — 1 and is defined from the eigenvalue equation: 

|Ti + Cl ■Im\= 0, 

for 

= C]_i, z,i = 

= 0. 

(More precisely, Cq is the largest real number satisfying this eigenvalue equa- 
tion.) 

We add = 1 (for = 0) to the list of roots {zk}^S^ on the unit 
circle with a variable z^o as its weight. Thus we set -h uuq = Cn for all 
n = 1, ... ,M — 1. In these notations the system of nonlinear equations 

M-l 

C'n = Vj -Z^, n = 1, ■■■ ,M- 1, 

j=o 

M-l 

y = 0 

can be represented, according to the (proof of) Caratheodory theorem as a 
determinantal identity: 

— Vq C\ — Uq . . . Cm-1 — 

C-i — l/Q —Uq . . . Cm-2 — ^0 

. = 0- 

C-M+2 — • • • —^0 

This equation can be written in the form 

|T — z/Q • Um| = 0, 

where, as above, (T)ij = Cj-i, i^j = 0, . . . , M — 1, for i ^ j and (T)i,j = 0, 
and Um is ’’all ones” matrix: = 1? = 0, . . . , M— 1. Consequently, 

we get a simple linear equation, defining z/q: 

|T| - z/o • |T'| = 0, 

where T' is an algebraic compliment of (elements of) T. 

Once z/Q is determined from this equation, all other Uj and Zj = for 
j = 1, . . . , M — 1 are determined from the Caratheodory theorem (as above) 
with C] = Cj - z/Q for j = 1, . . . , M - 1. 

This method is stable and also has the advantage that it determines the 
minimal number M of knots U with to = 0 of the periodic spline sp{t) of 
degree m that matches N Fourier coefficients Ci of /(t) (for M < N). 
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There are, however, some cases when |T'| = 0 and in those cases one 
cannot get any periodic spline sp{t) of degree m that matches M Fourier 
coefficients C{ of f{t) and has M knots on [0, 27t] with to = 0. It turns out 
that there is still a stable solution in this case as well; it just requires M knots 
to be in a general position in [0, 27 t]. 

Here we will use a second definition of a periodic spline sp 2 {t) on [ 0 , 27 t] 
that does not fix a knot at to = 0. In the second, a more complex, definition, 
the periodic spline sp 2 {t) of degree m on [0, 27 t] has M knots: 

0 < to < . . . < tM-l < 27T, 

and we can add by periodicity Im = to. The spline has the form: 

M-l 

sp 2 {t) = Pm(t) + ^ ai{U - 
i=0 

satisfying very strong periodicity conditions (of C’^-smoothness): 

sp^P{2'k) = fc = 0 , ... , m . 

This leaves total of 2M real variables in sp 2 {t): M for variable knots U, i = 
— 1,M — 1 for coefficients: ai at (t — t^)"^, i = 0, . . . , M — 1 and for 
a constant term of Pm{t). The single relation between ai is the following one: 

«0 + — ' + 1 0* 

This allows us to match 2M — 1 real Fourier coefficients of sp{t) with Fourier 
coefficients of /m-i( 0 (or f{t))^ or Cj for j = 0, ... ,M — 1, and still leaves 
one extra degree of freedom. 

This extra degree of freedom is very easy to observe. Since f{t) is 2tt - 
periodic, so is f{t — to) for any (real) to, and Fourier coefficients of f(t — to) are 
related to Fourier coefficients of f{t) by multiplicative factors of . Then 
we apply essentially the ’’simple periodic splines” methods, described above, 
to f(t — to) but now with M knots ti ^ U — to, i = 0, . . . , M — 1 (so one of 
the knots is a fixed one at t = 0). As long as the corresponding T' = T'(to) 
is non-singular (and that is the generic case), one has an extra parameter 
to to fit. It can be used, for example, to get a better L 2 -approximation by 
a periodic spline sp 2 (t) of degree m with M knots of /(t) that matches M 
Fourier coefficients of /(t). It also can be used to match one extra real or 
imaginary part of the Fourier coefficient cm • 

There are a variety of algorithms that can be used for the computations 
of the periodic spline approximations sp{t) and sp 2 {t). For a fixed degree m 
the basic algorithm is reduced to operations with Toeplitz matrices (or the 
construction of two-point Fade approximations). The standard (slow) method 
of computation are of complexity O(M^). These methods are either based on 
direct solution of Toeplitz equations or the use of the recurrence relations con- 
necting Fade approximants Q{z) for increasing M. The recurrence relations 
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follow from the classical three-term recurrence relations connecting polyno- 
mials orthogonal on the unit circle, see [S]. Those polynomials have the form: 



Co 


C-i 


. . . 


Cl 


Co 


. . . 


Cn-l 


Cn-2 




1 


z 


. . . z'^ 



While all zeroes of Pn{z) lie within the unit circle \z\ < 1, one constructs the 
” para-orthogonal” polynomials [J89] that have their zeroes on the unit circle 

kl = 1: 

Pn{z) W • pn{z) for \w\ = 1, 

where p'n(z) = z^ • {pn)*{l/z). Two-point Fade approximants Q{z) and poly- 
nomials Qe(^) from the proof of Caratheodory theorem are examples of para- 
orthogonal polynomials, see [J89]. 

The recurrence relations arising from polynomials orthogonal on the unit 
circle provide a recursive approach to the construction of optimal periodic 
splines sp{t) and spo{t). In this approach one can increase the number M 
of knots in a way such that previous knots ’s locations interleave the new 
ones. The recursive approach based on Fade approximants also provides a 
fast method of computation of splines sp{t) for a fixed f{t) and m but in- 
creasing M. This recursive approach based on a conventional ’’divide and 
conquer” polynomial interpolation with fast polynomial multiplication and 
division results in O (M log^(M)) complexity. 

3.4 A Very Simple Example. 

We present a very simple example of a single harmonics case: 

f{t) = cos{t), 

or Ck = 0 unless /c = ± 1, where c±i = In this case one can very easily 
determine analytically the knots tj and weights a^, and the quality of the 
spline approximation sp{t)^ as described in the previous section, to f{t) in 
various norms. There are 3 cases of different parity of m and M. 

The first, typical, case is of even m and odd M. We put M = 2n — 1, and 
the set of M knots tj, j = 0, . . . , M — 1 which is: 

{0} U : j = 1} U : j = n + 1, . . . , 2n - 1} 

(with ^0 =0). The corresponding i/j are: 

i^j = (-l)”"/2+i . ^ . sin(tj) for j = 1, . . . , M 
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(m is even). 

The second case is that of odd m and even M. We put M — 2n — 2 for 
even n. (The same solution also covers the case M = 2n—l for even n.) Here 
the set of M knots tj, j — 0, . . . , M — 1: 

77T Tl 

{0, 7t} U { — : j — 1, . . . , 2n — 1 and j not divisible by — }, 
n 2 

is an integer). The corresponding z/j are: 

i/j = (— . — . cos{tj) for j = 0, . . . , M 
n 



(m is odd). 

The third (and last) case is that of odd m and odd M (mod 4) == 1. We 
put M = 2n — 1 for odd n. Here the set of M knots tj^ j = 0, . . . , M — 1: 

{0, 7t} U : J == 1, . . . , 2n — 1 for odd j andj ^ n, 3n}, 

The corresponding i^j are: 

i/j = (_i)('«+i)/2 . 1 . cos(tj) for j = 1, . . . , M 

(m is odd) and z/q ~ 0 corresponding to to = 0. 

In all these cases, tj and Vj uniquely determine the spline approximation 
sp{t) (from formulas presented above for a^, j — 1, . . . , M — 1 and pm(t)). 

The standard measure of the average quality approximation of sp{t) to 
/(t) for an arbitrary /(t) is based on L 2 -norm: 

This quantity can be determined analytically using the Parceval identity and 
the Fourier expansion of f(t) — sp{t) that is easily computable in terms of 
trigonometric sums: 

M-l 

k = M,... 

j=o 

and Fourier coefficients of f{t). In turn, trigonometric sums are computed 
using P{z)^Q{z) polynomials from two-point Fade approximation to 
We define 



G{m,n) = 



(2m + 1)! ■ (2 n )2™+2 



for a polygamma function 'ip[z). This can be simplified to: 
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G(m, n) = —1 — 



^2m+l 

^ dz^m+i (cotinz)) 



z= 



1 

2n 



1 

(2m+ l)!(2n)2"^+2* 



When f{t) = cos(t) in all three cases above we get the following expression 
for e as a function of m and n: 



e = Y^G(m,n). 

As above, in the case 1 we have m even and M = 2n - 1; in the case 2 we 
have m odd and M = 2n — 2 for even n; in the case 3 we have m odd and 
M = 2n — 1 for odd n. 

Looking at log 2 e, we can see that log 2 e < -24 for m = 3, n > 33; m = 
4,n > 15;m = 5,n > 9; m = 6,n > 6;m = 7,n > 5;m > 8,n > 4; m > 
10,n > 3;m > 14; n > 2. 

We include several plots of f(t) = cos(t) and of the corresponding spline 
approximation sp(t) in Figs. 1-4. Fig. 5 shows the plot of the figure of merit 
log 2 e as a function of m, n. 




100 

Fig. 5. log 2 e(m, n) as a function of m, n. 



3.5 Multidimensional Closed Curves and Periodic Splines. 

The case of d-dimensional closed curves f(^) = (/i(^), . . . , /^(t)) approxi- 
mated by a periodic spline curve sp(^) = . . . , spd{t)) is similar to the 
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case of cZ- dimensional curves approximated by d-dimensional splines on the 
interval that matches the maximal number of moments. However, here we are 
using the two-point generalized Fade approximants (at z = 0 and 2 ; == 00 ) of 
the Second Kind. This problem also leads to an interesting generalization of 
the Caratheodory theorem where one expresses d sequences {c^} of Fourier 
coefficients by trigonometric exponentials 

M 

.^indj for a = 

J = 1 

with M harmonics , j = 1, . . . , M common to all d sequences (but differ- 
ent weights ly^). Just as Caratheodory theorem is used for a spectral estima- 
tion of a time series, the multidimensional problem is used for a joint spectral 
estimation of multiple time series (e.g., multi-channel data collected from the 
same source at the same time). 

In this problem, as above, we look at the d-dimensional curves f(t) = 
(/i(t ), . . . , fd{t)) that is 27T-periodic and is defined by its Fourier coefficients 
with the usual reality condition for. a = 1, . . . , d. We are trying 

to find a d-dimensional spline of degree m: sp{t) — (spi(t), . . . , spd{t))^ de- 
fined on [0, 27 t] and 27r-periodic that has M knots on [0, 27 t] (common to all 
individual spline functions spa{t)): 

0 to < ... <C tM—l ^ 27T, 

and matching the maximum number of Fourier coefficients c" of foc{t)^ a = 
1, . . . , d. The form of individual spline functions is the same as above: 

M-l 
2 = 0 

The problem is reduced to the generalized two-point Fade approximation 
problem of the Second Kind (i.e., approximants with the common denominator 
Q{z)) for functions 

M-l 

= E /« (^) 

fc=i 

at 2 : = 00 and 2 : = 0, respectively. 

The solution is reduced to linear algebra problems and to the following set 
of Toeplitz-like equations on the Fourier coefficients C^: 

M 

= 0, / = 

2=0 

for a = 1, . . . , d and Ea=o(^“ + 1) > M. The knots tj are determined from 
roots 2 : = Zj of the associated polynomial Q{z) = '^fLo 
unit circle by identification Zj = , j = 1, . . . , M . 



M-l 






C%z 



k-1 



k=l 
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Interactions between number theory and 
operator algebras in the study of the Riemann 
zeta function (d’apres Bost— Connes and 

Connes) 

Paula B. Cohen 

August 8, 2000 



Abstract 

This paper arose as a written version of a lecture given at the City University of 
New York Number Theory Seminar in April 1997. The lecture was an overview of ideas 
of Bost-Connes and of Connes on possible ways to approach the study of the Riemann 
zeta function using ideas inspired by noncommutative geometry. The work of Connes is 
directly aimed at a solution of the Generalised Riemann Hypothesis and has undergone 
substantial improvement in the intervening time. In this article we give an overview of 
the original work and these later developments. 



1 Introduction 

In 1859 Riemann published an important foundation 2 d paper on the Riemeinn zeta function. 
Recall that this function is given for Re(s) > 1 by 

n=l 

and that it has a continuation to all the complex plane which is analytic except for a simple 
pole at s = 1. It is straightforward to show that the Riemann zeta function has zeros at the 
negative even integers and these are called the trivial zeros of the Riemann zeta function. 
The Riemann Hypothesis predicts that the remaining zeros lie on the line Re(s) = 1/2. One 
knows that the non-trivial zeros of C(s) lie in the band Re(s) G]0, 1[. The generalisation of 
this function for a number field is known as the Dedekind zeta function. It encodes much 
important arithmetical information about the field. One of the major motivations of number 
theory is to understand more fully the Dedekind zeta function, the most famous challenge 
being to understand the locus of the zeros of this function and in particular to settle the 
validity of the Generalised Riemann Hypothesis for these functions. Much powerful work 
has been done in ansdytic number theory in the attempt to solve the Generalised Riemamn 
Hypothesis directly. 

As the study of the zeros of the Riemann zeta function and its generalisations is so 
difficult, one may ask how it is possible to recast the problem. For example, Polya and 
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Hilbert proposed that if one can construct a Hilbert space % and an operator D inli whose 
spectrum comprises the zeros of the Riemann zeta function in the band Re(s) €]0, 1[, then 
possibly one can settle whether or not — 1/2) is self-adjoint or whether JP(1 — D) is 

positive, which would imply the Riemann hypothesis. The point here is that the properties 
of self-adjointness or of positivity are hopefully easier to check. It is important that the 
construction of the Hilbert space and the operator should not depend a priori on the zeta 
function, to avoid tautologies. An as yet non-rigorous approach to the Riemann hypothesis 
initiated by Connes in [5] and further developed in [6], [7], includes an interesting and rigorous 
interpretation of the zeros of the L-functions with Grossencharacter of a global field in terms 
of the action of the idele class group on the coset space of the adeles modulo the principal 
ideles. 

This work of Connes derives in particular from his work with Host [2] which was m 
turn inspired by ideas of Julia [11] and others. The aim is to enrich our knowledge of the 
Riemann zeta function by creating a dictionary between its properties and phenomena in 
statistical mechanics. The starting point of these approaches is the observation that, just 
as the zeta functions encode arithmetic information, the partition functions of quantum 
statistical mechanical systems encode their large-scale thermodynamical properties. The 
first step therefore is to construct a quantum dynamical system with partition function the 
Riemann zeta function, or the Dedekind zeta function in the general number field case. In 
order for the quantum dynamical system to refiect the arithmetic of the primes, it must 
capture also some sort of interaction between them. This last feature translates in the 
statistical mechanical language into the phenomenon of spontaneous symmetry breaking at 
a critical temperature with respect to a natural symmetry group. In the region of high 
temperature, there is a unique equilibrium state as the system is in disorder and symmetric 
with respect to the action of the symmetry group. In the region of low temperature, a phase 
transition occurs and the symmetry is broken. This symmetry group acts transitively on a 
family of possible extremal equilibrium states. The construction of a quantum dynamical 
system with partition function the Riemann zeta function C(/3) and spontaneous symmetry 
breaking or phase transition at its pole /3 = I with respect to a natural symmetry group 
was achieved by Host and Connes in [2]. A different construction of the basic algebra using 
crossed products was proposed by Laca and Raeburn and extended to the number field case 
by them with Arledge in [1]. An extension of the work of Host and Connes to general global 
fields was done by Harari and Leichtnam in [10]. The generalisation proposed by Harari and 
Leichtnam in [10] fails to capture the Dedekind zeta function as partition function in the 
case of a number field with class number greater than I. Their partition function in that case 
is the Dedekind zeta function with a finite number of non-canonically chosen Euler factors 
removed. This prompted the author’s paper [3] where the full Dedekind zeta function is 
recovered as partition function. This is achieved by recasting the original construction of 
Host and Connes more completely in terms of adeles and ideles. The symmetry group of 
the system constructed by Host and Connes is a Galois group, in fact the Galois group over 
the rational number field of its maximal abelian extension. Using the Artin isomorphism, 
which says that this symmetry group is also the unit group of the finite ideles. Host and 
Connes recover the actual Galois action on the elements of the maximal abelian extension 
via its action on the equilibrium states of the system. In the general number field case, 
the symmetry group is again the unit group of the finite ideles, but this group does not in 
general have a Galois interpretation. See [10] for a discussion of this point. 

In [7], Connes outlines a tantilising analogy between the Galois correspondence in number 
theory and the classification of factors in the theory of von-Neumann algebras. This is in 
the spirit of the idea of Andre Weil that the key to understanding the Riemann hypothesis 
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may well lie in a decent Galois interpretation of the id^le class group of a number field. 
The difficulty here lies in the intervention of the archimedean valuations and lies beyond, 
for example, what is known about the case of function fields of varieties over finite fields. 
Connes maintains that aspects of the theory of the classification of factors represents a type 
of Galois theory adapted to the real and complex numbers. 

In this paper we provide an introduction both to the work of Bost-Connes and to the 
work of Connes on the Riemann hypothesis. Our purpose is to assist the interested reader 
to understand these works. We restrict our attention to the rational numbers and to the 
Riemann zeta function. It should be noted that a solution of the Riemann hypothesis may 
well necessarily involve studying the zeta function 2 is part of a family of zeta functions in 
the spirit of [12]. 

Acknowledgements: The author thanks the Ellentuck Fund and the School of Mathematics 
of the Institute for Advanced Study, Princeton, for their support during the preparation of 
this paper. 

2 The problem studied by Bost— Connes 

Before stating the problem solved by Bost and Connes in [2] and its analogue for number 
fields, we recall a few basic notions from the (7*-algebraic formulation of quantum statistical 
mechanics. For the background, see [4j. Recall that a C'*-algebra B is an algebra over the 
complex numbers C with an adjoint x ^ x*, x € Bj that is, an anti-linear map with x** = x, 
(xy)* = y*x*, x,y 6 B, and a norm || . || with respect to which B is complete and addition 
and multiplication are continuous operations. One requires in addition that ||xx*|| = ||x|p 
for all X G B. All our C* -algebras will be assumed unital. The most basic example of a 
noncommutative C*-algebra is B = Mn{C) for AT > 2 an integer. The G*-algebra plays the 
role of the “space” on which the system evolves, the evolution itself being described by a 
l-p 2 u:ameter group of G* -automorphisms or : R Aut(B). The quantum dynamical system 
is therefore the pair (B, at). It is customary to use the inverse temperature P = 1/kT rather 
than the temperature T, where k is Boltzmann’s constant. Then, one has the definition of 
Kubo-Martin-Schwinger (KMS) of an equilibrium state at inverse temperature p. Recall 
that a state on a C7* -algebra B is a positive linear functional on B satisfying y?(l) = 1. It 
is the generalisation of a probability distribution. 

Definition 1 Let (B^crt) be a dynamical system, and (p a state on B. Then (p is an equilib- 
rium state at inverse temperature P, or KMSp-state, if for each x,y £ B there is a function 
Fx,y(z), bounded and holomorphic in the band 0 < Im(z) < p and continuous on its closure, 
such that for o// 1 6 R, 



Fx,y{t) = ^(x(Tt{y)), Fx,y{t -f V^P) = (1) 

In the case where B = M//(C), every 1-parameter group at of automorphisms of B C£in be 
written in the form, 



at{x) = e**^xe ^ x G B, f G R, 

for a self-adjoint matrix H = H*. Then for B > 0 and for all /9 > 0, there is a unique 
KMSp equilibrium state for (B,at) given by 

(l>p{x) = Trace(xe-^")/Trace(e-^"), x G Mn{C). (2) 
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This has the familiau: form of a Gibbs state and is easily seen to satisfy the KMS/j condition 
of 1. The KMS/j states can therefore be seen as generalisations of Gibbs states. The 
normalisation constant lVace(e“^^) is known as the partition function of the system. A 
symmetry group G of the dynamical system {By at) is a subgroup of Aut(B) commuting 
with a: 

goat-atOQy p € G, f G E. 

Consider now a system {By at) with interaction. Then, guided by quantum statistical me- 
chanics, we expect to see the following features. When the temperature is high, so that ^ 
is small, the system is in disorder, there is no interaction between its constituents and the 
state of the system does not see the action of the symmetry group G\ the KMS/j-state is 
unique. As the temperature is lowered, the constituents of the system begin to interact. 
At a critical temperature /?o a phase transition occurs and the symmetry is broken. The 
symmetry group G then permutes transitively a family of extremal KMS/ 3 - states generat- 
ing the possible states of the system after phase transition: the KMS; 3 -state is no longer 
unique. This ph 2 use transition phenomenon is known as spontaneous symmetry breaiking at 
the critical inverse temperature Pq. The partition function should have a pole at Pq. For a 
fuller explanation, see [2]. The problem solved by Bost and Connes was the following. 

Problem 1 : Construct a dynamical system {By at) with partition function the zeta function 
C{P) of Riemann, where P > 0 is the inverse temperature, having spontaneous symmetry 
breaking at the pole P = 1 of the zeta function with respect to a natural symmetry group. 

As mentioned in the introduction, the symmetry group is the unit group of the ideles, 
given by W = Hp where the product is over the primes p and Z* = {up e ^ : \up\p = 1}. 
We use here the normalisation |p|p = This is the same as the Galois group Gal(Q^^/Q). 
Here is the maximal abelian extension of the rational number field Q, which in turn is 
isomorphic to its meiximal cyclotomic extension, that is the extension obtained by adjoining 
to Q all the roots of unity. The interaction detected in the phase transition comes about 
from the interaction between the primes coming from considering at once all the embeddings 
of the non-zero rational numbers ((J* into the completions Qp of Q with respect to the prime 
valuations |.|p. The natural generalisation of this problem to the number field case was 
solved in [3] and is the following. 

Problem 2 : Given a number field K, construct a dynamical system {B,at) with partition 
function the Dedekind zeta function Ck{P), where P > 0 is the inverse temperature, having 
spontaneous symmetry breaking at the pole /3 = 1 0 / the Dedekind function with respect to a 
natural symmetry group. 

Recall that the Dedekind zeta function is given by 

ReW>l- (3) 

Here O is the ring of integers of K and the summation is over the ideals C of K contained in 

0. The symmetry group is the unit group of the finite ideles of K. The main new difficulty 
that arose in the number field case was to generalise to the case of class number greater than 

1 . One has to use the fact that the principal prime ideals already provide enough interaction 
to engender a phenomenon of spontaneous symmetry breaking. 

For the natural generalisation to the function field case see [10]. For the sake of expo- 
sition, we restrict ourselves in the sequel to the case of the rational numbers, that is to a 
discussion of Problem 1 . 
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3 Construction of the C*-algebra 

We give a different construction of the C* -algebra of [2] to that found in their original paper. 
It is essentially equivalent to the construction of [1], except that we work with adeles and 
ideles. In the gener 2 disation to the number field caise, this makes quite a difference. Let A 
denote the finite adeles of Q, that is the restricted product of with respect to Zp. Recall 
that this restricted product consists of the infinite vectors (flp)p, indexed by the primes p, 
such that Up 6 Qp with Op 6 Zp for almost all primes p. The (finite) adeles form a ring 
under componentwise addition and multiplication. The (finite) ideles J are the invertible 
elements of the adeles. They form a group under componentwise multiplication. Let ZJ be 
those elements of Up 6 Zp with |up|p = 1. Notice that an idele (up)p has Up € <1^ with 
Up G ZJ for almost all primes p. Let 

7i = l][Zp, i = jr\n, w = J[z;. 

P P 

Further, let I denoted the semigroup of integral ideals of Z. It is the semigroup of Z-modules 
of the form mZ where m € Z. Notice that I as above is also a semigroup. We have a natural 
short exact sequence, 



1 W I 1. (4) 

The map J I in this short exact sequence is given as follows. To (up)p € I associate the 
ideal where ordp(up) is determined by the formula |up|p = p-^^^pM. It is clear 

that this map is surjective with kernel W, that is that the above sequence is indeed short 
exact. By the Strong Approximation Theorem we have 

Q/Z-.4/7e-0pQp/Zp (5) 

and we have therefore a natural action of I on Q/Z by multiplication in A/TZ and transport 
of structure. We use here that I7Z C IZ. Mostly we shall work in A/1Z rather than Q/Z. 
We have the following straightforward Lemma (see [3]). 

Lemma 1 For a = (op)p G I and y G A/1Z, the equation 



ax = y 



has n(a) =: solutions in x G A/IZ. Denote these solutions by [a; : ax = i/]. 

In the above lemma it is important to bear in mind that we are computing modulo IZ. 
Now, let C[A/1Z] =: span{<5x : x G A/TZ] be the group algebra of A/IZ over C, so that 
= Sx^x' for x,x' G A/7Z. We have (see for comparison [1]), 



Lemma 2 The formula 






x:ax—y] 



for a e I defines an action of I by endomorphisms of C*(A/'JZ), 



The endomorphism Oa for a G / is a one-sided inverse of the map S X ^<xx for X G A/1Zj so 

it is like a semigroup “division” . The C* -algebra can be thought of as the operator norm 
closure of C[A/TZ] in its natural left regular representation in 1^{A/TZ). We now appeal to 
the notion of semigroup crossed product developed by Laca and Raeburn and used in [1], 
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applying it to our situation. A covariant representation of (C'*(v4/72-),7, a) is a pair (tt, F) 

wViPrp 

7T : C*(Ain) B(U) 

is a unital representation and 

V :I-^ Bin) 

is an isometric representation in the bounded operators in a Hilbert space %. The pair 
(tt, V) is required to satisfy, 

Ac‘M)) = VMf)v:, aei, feC^iA/iz). 

Notice that the Va are not in general unitary. Such a representation is given by (A, L) on 
with orthonormal basis {cx : x € A/7Z} where A is the left regular representation 
oi C*{A/n) on l^{A/n) and 



— 



1 









The universal covariant representation, through which all other covariant representations 
factor, is called the (semigroup) crossed product C*{AI7Z) x^/. This algebra is the universal 
C*-algebra generated by the symbols {e{x) : x € A/H] and {fXa : a G /} subject to the 
relations 



= 1, l^at^b = l^ab, a,b€l, (6) 

e(0) = 1, e(x)* = e(-x), e(x)e(y) = e(x + y), x,y e A/H, (7) 

-T1 Z ^(.^) = t^ae(y)K> ael,y e A/n. (8) 

The relations in (6) reflect a multiplicative structure, those in (7) an additive structure and 
those in (8) how these multiplicative and additive structures are related via the crossed 
product action. Julia [11] observed that by using only the multiplicative structure of the 
integers one cannot hope to capture an interaction between the different primes. When 
It € W then is a unitary, so that = 1 and we have for all x G A/lt^ 

fXue{x)fj,l = e(u~^x), fxle{x)nu = e{ux). (9) 

Therefore we have a natural action of W as inner automorphisms of C*{A/1Z) / using 

(9). 

To recover the C*-algebra of [2] we must split the short exact sequence (4). The ideals 
in I are all of the form mZ for some m G Z. This generator m is determined up to sign. 
Consider the image of lm| in I under the diagonal embedding q (q)p of Q* into /, where 
the p-th component of {q)p is the image of g in under the natural embedding of Q* in 
. The map 



+ : mZ »-> (|m|)p (10) 

defines a splitting of (4). Let 1+ denote the image and define B to be the semigroup 
crossed product C*(A/7Z) x« with the restricted action a from / to By transport 
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of structure using (5), this algebra is easily seen to be isomorphic to a semigroup crossed 
product of C*(Q/Z) by M^., where N+ denotes the positive natural numbers. This is the 
algebra constructed in [2] (see also [1]). ^FVom now on, we use the symbols {e(x) : x e Q/Z} 
and {fia : a € N+}- It is essential to split the short exact sequence in this way in order to 
obtain the symmetry breaking phenomenon. In particular, this replacement of I by now 
means that the group W acts by outer automorphisms. For x € B, one has that /ijx/iu is 
still in B (computing in the larger algebra C*{A/Tl) Xa I), but now this defines an outer 
action of W. This coincides with the definition of PV” as the symmetry group as in [2]. 

4 The Theorem of Dost— Connes 

Using the abstract description of the C*-algebra B of §3, to define the time evolution a of 
our dynamical system (B,a) it suffices to define it on the symbols {e(x) : x 6 Q/Z} and 
{/ia ’• CL ^ }• For t G R, let cTt be the automorphism of B defined by 

- m*\ meN+, (7t(e(x)) = e(x), x € Q/Z. (11) 

By (6) and (9) we clearly have that the action of W commutes with this 1-parameter group 
at. Hence W will permute the extremal KMS/j-states of (B^at). To describe the KMS/j- 
states for /? > 1, we shall represent (B^at) on a Hilbert space. Namely, following [2], let 
y, be the Hilbert space /^(N^.) with canonical orthonormal basis {£m,Tn G Rf.}. For each 
u G IF, one has a representation tTu of B in B{'H) given by, 

= ^mn, m, n G N+ 

7Tu(e(x))s„ = exp(2i7rnu o x)en, n G N+, x G Q/Z. (12) 

Here u o a; for u G IF and x G Q/Z is the multiplication induced by transport of structure 
using (5). One verifies easily that (12) does indeed give a C*-algebra representation of B. 
Let H be the unbounded operator in H whose action on the canonical basis is given by 

Hen = (logn)en, n G N-|_. (13) 

Then clearly, for each u G IF, we have 

= e*^^7Tu(x)e“*^^, t G E, X G H. 

Notice that, for /3 > 1, 

’IVace(e"^") = = ^n~^, 

n=l n=l n=l 

SO that the Riemann zeta function appears as a partition function of Gibbs state type. We 
can now state the main result of [2]. 

Theorem 1 (Bost-Connes) The dynamical system {B^at) has symmetry group W. The 
action of u £ W is given by [u] G Aut(B) where 

[u] : e{y) i-+ e(u oy), ye Q/Z, [u] : /Ja Mo, o e N. 
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This action commutes with a, 

[u] oGt — Gto [u], u ew^ t eR. 

Moreover, (1) for 0 < ^0 < 1, there is a unique KMS/j state. (It is a factor state of Type IIIi 
with associated factor the Araki- Woods factor RooJ (2) for P> 1 and u , the state 

<t>0,u{x) = C(/5)~^Trace(7rtx(x)e"^^), x e B 

is a KMS /3 state for (It is a factor state of Type loo)- The action ofW on B induces 

an action on these KMS/j states which permutes them transitively and the map u i-> <t>p^u 
is a homomorphism of the compact group W onto the space Sp of extremal points of the 
simplex ofKMSp states for (B,Gt). (3) the C function of Riemann is the partition function 
of{B,at). 



Part (1) of the above theorem is difficult and the reader is referred to [2] for complete details, 
as for a full proof of (2). That for > 1 the KMS^g-states given in part (2) fulfil 1 of §2 is a 
straightforward exercise. Notice that they have the form of Gibbs equilibrium states. 

5 The space and group action in Connes’ approach to 
the Riemann hypothesis 

Theorem 1 solves Problem 1 of §2. More information is contained in its proof however. As 
mentioned in the §1, given the existence of the Artin isomorphism in class field theory for 
the rationals, one can recover the Galois action of W explicitly. It is still an open problem to 
exhibit this Galois action in terms of an analogue of {B, at) in a satisfactory way for general 
number fields. Indeed, in [3] the analogue of W is again an infinite product over unit groups 
in the integers of local fields, completions of the number field at the prime ideals. But this is 
no longer in general isomorphic to the Galois group of the maximal abelian extension of the 
number field, which is the symmetry group one wants to recover. It appears that one must 
generalise the approach of Bost-Connes so that the extremal KMS states, when restricted to 
the basic symbols generating the C* -algebra, take values in the maximal abelian extension 
of the number field. The action of the symmetry group on these extremal states should then 
induce the action of the Galois group of the maximal abelian extension on these values. 

Another feature occurs in the analysis of the proof of part (1) of Theorem 1. One can 
treat the infinite places in a similar way to that already described for the finite places, so 
working with the (full) adeles A and (full) ideles J. The ring of adeles A of Q consists of 
the infinite vectors (floo,ap)p indexed by the archimedean place and the primes p of Q with 
Op 6 Zp for all but finitely many p. The group J of ideles consists of the infinite vectors 
(noojWp)p with Woo € R,tioo ^ 0 and Up £ Qp,Up ^ 0 and |up|p = 1 for all but finitely 
many primes p. There is a norm | • | defined on J given by |u| = |uoo| lip I^pIp* We have 
natural diagonal embeddings of Q in A and Q* = Q \ {0} in J induced by the embeddings 
of Q into its completions. Notice that by the product formula Q* C Ker| • |. We define an 
equivalence relation on A by o = 6 if and only if there exists a g € Q* with a = qb. With 
respect to this equivalence, we form the coset space X = A/Q" . The ideles J act on A by 
componentwise multiplication, which induces an action of C = J/Q" on X. Notice that this 
action has fixed points. For example, whenever an adele a has Op = 0 it is a fixed point of 
the embedding of into J (to ^p G one assigns the idele with 1 in every place except the 
pth place.) On the other hand, every Type IIIi factor has a continuous decomposition, that 
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is it can be written as a cross-product of R with a Type IIoo factor. Connes has observed 
that the von-Neumann algebra of Type IIIi in the region 0 < /3 < 1 of Theorem 1 has in 
its continuous decomposition the Type IIo© factor given by the crossed product of L°°{A) 
by the action of Q* by multiplication. The associated von Neumann algebra has orbit space 
X^A/Q^. 

The pair (X, C) plays a fundamental role in Connes ’s proposed approach to the Riemann 
hypothesis in [5]. We remark that Connes *s approach does not as yet reprove the established 
analogue of the Riemann hypothesis in this case: if it did so this would provide an alternative 
proof. Indeed, even for the case of the projective line, where the Riemann hypothesis is 
trivial, the approach of Connes demands an analysis not only of the counting of points over 
finite fields on the projective line, but also an understanding of the moduli space of vector 
bundles over the projective line, due to the fact that all the characters of the id^le class group 
also intervene. In order for his approach to work, Connes would need to be able to prove 
an asymptotic formula for the trace of the action of C on X. A “spectral” analysis of this 
action, related to older ideas of Weil [17] and Tate, shows it to be related to the non-trivial 
zeros of the L-functions with Grossencharacter, and a heurisitic “geometric” analysis relates 
this action to the Weil distribution. The “geometric” heuristics aim at suggesting a proof 
for the positivity of the Weil distribution which is known to imply the Riemann hypothesis. 
In the subsequent sections we give a leisurely introduction to these ideas. 



6 A trace formula over the reals 



There is a rigorous local version of the geometric side of the asymptotic trace formula of 
[6]. For simplicity we work at the infinite place, the discussion at the prime places being 
analogous, and justify the formula in terms of the simple computation of the distributional 
trace. Consider the action of R* = R \ {0} on R by multiplication. This induces the action 
on smooth functions on R, 

U{u)g = gou-\ uGR*, p € (7~(R). (14) 



We average this action over a function of rapid decay. Namely, for h G «S(R*), we let U{h) 
be the operator in L^(R) given by. 



U(h) = f hiu)U{u)d*u, 

Jr* 

where the multiplicative Haar measure d*u is normalised by, 

d*ii~logA, A -> 00. 



/ 



|w|€[l,A] 

The associated kernel of U{u) is 

k{x,u~^,y) = S(y-u~^x), 



and for u ^ 1, 

f k{XjU~^ yx)dx = \1 — . 

Jr 

If h £ S{R*) and h{l) = 0, we define the distributional trace of U{h) to be. 



(15) 



(16) 

(17) 

(18) 



“Trace” ([/(/i)) = / 
Jr- 



h{u *) 
|l-u| 



d-u. 



(19) 
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To state an ordinary trace formula in the general case h(l) ^ 0, we introduce the principal 
part of the integral in (19), defined by 




A(u-‘) 
11 -u| 



d*u =< Ly 



h{u 



>, 



( 20 ) 



where L is the unique distribution on R which agrees with for u 7 ^ 1 and whose Fourier 
transform vanishes at 1. If g(u) = h{{u 4* l)~^)/|u -f 1|~^, then from [ 6 ], §V, we have the 
formula 

/ t p 



Let A > 0 and let P\ be the projection given by multiplication by the characteristic function 
of the set 



e : ^{x) = 0, for |j:| > A}. (22) 

Let F denote the Fourier transform in L^(E) and P\ = FP\F~^. Let R\ = PaPa- In [ 6 ] 
the following is proved. 

Proposition 1 // A G «Sc(K*); then RAU{h) is a trace class operator in L^(R) and as 
A -> oo lye have the asymptotic formula 

TVace(flAl/(h)) = 2A(1) log A + f + o(l). (23) 

J |l-u| 

An analogous result holds for the local fields and also if one considers a finite number 
of plaees at a time. The problem is that the error term then implied by the “o(l)” is not 
uniform in the set of primes chosen so that one cannot pass to the limit over all primes. 
This seems to represent a deep difficulty of an analytic number theoretic nature. 

7 The Polya-Hilbert space 

Connes proposes a Polya-Hilbert space for the problem in the following way. Let S{A)o 
denote the subspace of S{A) given by, 

5(A)o = {/ e 5(>1) : m = I fdx = 0}. (24) 

Let E be the averaging over Q* operator which to / G S{A)o associates the element of S{C) 
given by 



EU){u) = 



(25) 



For <5 > 0, let L^(A’)o,<s be the completion of «S(A)o with respect to the norm given by, 

ll/ll? =/ W)(«)l'“(l + log' M)'/Vu, for / 

JC y|u|e[i,Al 



d*u:iilogA, A -> 00 . (26) 
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If g(x) = f{qx) for some fixed g € Q^, then ||p||j = ||/||^ and so one sees that this norm 
respects, in this sense, the passage to the quotient . We define L'^{X)s by the short 
exact sequence 



0 L^(X)o,s L^{X)s C e C(l) -V 0. (27) 

When 6 = 0 we write L^{X)o and Li^(X) for the first two terms. Here C is the trivial 
C-module and C(l) is the C-module for which n € (7 acts by |u|, where | • | is the norm on 
C. Multiplication of C on A induces a representation of C on Li^(X) given by, 

(i/(A)0(x)=^(A-‘x). (28) 

We introduce a Hilbert space Hs via another short exact sequence, 

0 L2(X)o,6 -4 ^ 0, (29) 

Here L'^(C)s is the completion with respect to the weighted Haar measure as in(26), where 
we write L^(C) when <5 = 0. The spectral interpretation on Hs of the critical zeros of the 
L-functions in [5] relies on taking 5 > 0. Indeed, this is needed to control the growth of 
the functions on the non-compact quotient X : ultimately this parameter is eliminated from 
conjectural trace formula by using cut-offs. It is important here to use the measure |uld*u 
(implicit in (26)) instead of the additive Haar measure dx, this difference being a veritable 
one for global fields, where one has dx = lime_yo £\x\^'^^d*x. 

The regular representation descends to Hs (it commutes with up to a phase as an easy 
calculation shows) and we denote it by W, Connes describes (Hs,W) as the Polya- Hilbert 
space with group action for his approach to the Riemann hypothesis. He proves in [5] and 
[6] the following remarkable result relating the trace of this action to the zeros on the critical 
line of the L-functions with Grossencharacter. 

Theorem 2 For any Schwartz function h € S{C) the operator 

W{h) = f W{u)h{u)d*u 
Jc 

in H is trace class, and its trace is given by 

'n'ace(H^(/»)) = ^ KxJs) 

L(x,^+i«)=0,«eR 

where the sum is over the characters x of C with the multiplicity of the zero being counted 
as the largest integer n < |(1 -I- 6) with n at most the multiplicity of \ is as a zero of 
L{XfZ). Moreover, we define, 

h{x,z):= [ h(u)x{u)\u\^dru. 

Jc 

Now, the action of C is free on L^((7)5 so that the short exact sequence (29) tells us that 
the trace of the action of C on Hs should be, up to a correction due to a regularisation, the 
negative of the trace of the action of C on L^(X)o^s- Prom (27), we see that the regularised 
trace of the action of C on L^{X)s should involve the sum of the corresponding trace on 
L^(X)o ,<5 and the trace on C © C(l). Therefore the regularised trace of the action of C on 
L^(X)s should involve the trace of the action on C © C(l) minus the trace of this action on 
Hs. This minus sign is crucial for the comparison with the Weil distribution. 
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The following result going back to Weil [17] shows how the operation E in (25) brings in 
the zeros of the L functions in the critical strip and provides the key to proof of Theorem 2 
and the appearance of the non-trivial zeros of the T(x, s) in Proposition 3. It indicates that 
the non-trivial zeros of the L-functions should “span” Hs as they are “orthogonal” to the 
image oiE 

Proposition 2 Let x a character on C. For any p G C with Re(p) have 

f ^(O(ii)xMKd*w = 0, forallf g5(A)o 
Jc 

precisely when L(x, ^ + p) = 0. 

8 Relation to the Weil distribution 

In [6] Connes works with a cut-oflF in order to avoid the parameter J of §7. He introduces 
a family of subspaces Ha,o of depending on a real parameter A > 0, such that 

E{B\,o) C Sa where, 

Sa = e L\C) : e(u) = 0, for |u| ^ (30) 

In the case of global fields of positive characteristic it makes sense to introduce J5 a,o as 
the space spanned by the / G S{A)o that together with their Fourier transform vanish for 
|x| > A. In the number field case, all such functions (on the reals) are trivial, and Ba,o is 
described in terms of prolate spheroidal functions. This aspect of [6] is very technical and 
we do not attempt to go into it here as it offers little additional insight into the underlying 
ideas. As the asymptotic trace formula involving them is conjectural, we assume for our 
present purpose the existence of an appropriate family of Ba,o- Let Qa,o be the orthogonal 
projection onto Pa.o and q = EQ^^oE~^. By assumption, 

Oa.o<*5a, (31) 

where S\ denotes the projection onto the set Sa in (30). Therefore, for all A > 0, the 
following distribution is positive, 

Aa(/) = Trace((5A - Ok.o)^(/)), / ^ (32) 

where V is the regular representation of C on L^{C) and V(f) = f{h)V(u)d*u. The 
positivity of Aa signifies that for / G S{C) we have 

Aa(/*D>0, (33) 

where f*{u) = /(u“^). Therefore the limiting distribution is also positive, 

Aoo — Ihn Aa > 0. (34) 

A— ^oo 

In [6] the following conjecture is formulated, together with a proposed construction of the 
family Ba,o- 

Conjecture 1 One can find subspaces Ba,o of L^(X)q such that the distributions Aa are of 
positive type and converge to the Weil distribution^ so that for h G S{C) of compact support, 

Aco W = [ /»(u)(|u|i/2 + - 5^ /■' (35) 

where v runs over the places of Q. 
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It is known that the positivity of the Weil distribution ^ which would follow from Conjecture 
1 using (34), implies the Riemann hypothesis (see for example [14]). One can view Aa, via 
(32), as a distribution associated to the regularised trace of the action of C on the Hilbert 
space Hs of §7 with the cut-off at |u| = A enabling one to take <5 = 0. The factor in 
the integrands of (35) as opposed to those of (23) is due to the passage to the quotient by 
the image of J5. 

We can reformulate Conjecture 1 in terms of the sequence of projections Q\ where Qx 
is the projection in L^{X) onto 5 a, o © C 0 C(l) as follows. 

Conjecture 2 There is a sequence of closed projections Qx, A > 0, in L^{X) extending 
the Qx,o above such that for h € S{C) of compact support we have, as A — > oo, 

Ttace(QAU(h)) = 2/i(l) log' A + V + °(1)> (36) 

y jQt 

where the sum is over the places v of Q. Here 21og^ A = /|;^|gjA-VAj which is asymptotic 
to 2 log A for the correct choice of Haar measure d*u on C. 



9 Relation of the limit distribution to the zeros 



The distributions Aa can be regarded as a sequence of distributions attached to the Polya- 
Hilbert space and action {Hs,W) of §7. We have the following interpretation of Ao© in 
terms of the zeros of the L-functions with Grossencharacter: compare with Theorem 2 and 
notice how the use of the asymptotics of the cut-off has eliminated the parameter <5. 

Proposition 3 Let h be a function of compact support on S{C). Then, 



/\oo(h) = '£mx, 

x,p 




(37) 



where the sum is over the pairs (x, p) of characters of C and zeros p of L(x, \ + p) ^ith 
Re{p) €] — |, |[. The number iV(x, | 4- p) is the multiplicity of the zero, the measure dpp{z) 
is the harmonic measure of p with respect to the line iE C C and the Fourier transform h 
of h is defined by 

Hx.p)= f h{u)x(u)\u\^d*u. (38) 

Jc 

In [6] Proposition 3 is proven in the function field case, and the proof for number fields 
is outlined. It allows one to derive under Conjecture 1, the following conjectural explicit 
formula. 



Conjecture 3 Let h € S{C) he of compact support, then 




h{u) 

\T=V\ 






= f + l“l N(x, l+p) [ Mx, z)dHp!,z), 

Jc •7^6»R 



where v runs over the places of Q. 
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Proposition 3 gives an interpretation of the non-trivial zeros of the Riemann zeta function 
with the non-critical zeros appearing as resonances for the harmonic measure with respect 
to the real line. Conjecture 3 is the globed analogue of Proposition 1 and can be thought 
of as a conjectural trace formula for the action of C7 on L^{X). In accordance with the 
comments preceding Proposition 2, one sees that for the action of C on the critical 

zeros would enter the trace formula with a minus sign, which Connes interprets as their 
being an absorption spectrum for the action of C on A/Q^. This represents an important 
new feature over other proposed spectral models to date coming from physics. It is in 
complete agreement with the intuitions gleaned form the proofs in the function field case 
that yield a spectral interpretation of the zeros as the eigenvalues of the action of Frobenius 
on /-adic cohomology. The analogue of the Polya-Hilbert space is given, in the curve case (if 
one replaces C by Q , with I ^ p where p is the base field characteristic) by the 1st degree 
etale cohomology group on the curve with coefficients in Qi. This group appears with an 
overall minus sign in the Lefchetz formula. In other words, the spectral interpretation of 
the zeros of the Riemann zeta function should be as an absorption spectrum rather than as 
an emission spectrum, if one uses the language of spectroscopy. 

10 A global geometric trace formula 

One of the most interesting heuristic aspects of Connes ’s ideas, which appears already in 
[5], is the interpretation of the global analogue of Proposition 1 as a non-commutative trace 
formula for a flow given in Conjecture 2. In [5], the validity of Conjecture 2 was interpreted as 
the existence of a noncommutative generalisation of a distributional trace formula for flows 
on compact manifolds (see [9]). If M is a smooth compact manifold with an everywhere 
non- vanishing vector field f, then the associated flow is given by the 1-parameter group 
{Ft = expffjteR which induces an action of R on smooth functions on M by, 

U{t)f = foFu feC^(M). (39) 

For h 6 S{M), consider the operator, 

U(h) = f U{t)h(t)dt, (40) 

Jr 

where we suppose for simplicity that h{0) = 0. Then we know from [9] that the distributional 
trace is given by, 

“Trace” (f/(/i)) = ?/». (41) 

where the sum is over the primitive periodic orbits ir and where is the isotropy group 
of any x £ n with measure d„t normalised so that has covolume 1. We have denoted 
by Pt the restriction of the differential d{exp{t^))x to the space transverse to the orbits (the 
Poincare map) and by |1 — Ft | the absolute value of the determinant of 1 — Pt which is 
assumed non-degenerate. Now, observe the following fact about the pair (X, C) proved in 
[ 5 ]. 

Lemma 3 For x £ XjX ^ 0, the isotropy group of x in C is compact if and only if there 

exists exactly one place v of Q with = 0 where (xooiXp)p is any lift of x to A. 

U X £ A with Xt, = 0 at only one place v of Q, then the isotropy group of x in C is 

isomorphic to and the transverse space is isomorphic to Qv • Hence the analogue of the 
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Poincar4 return map for this fixed point is multiplication of QJ on Qv whose trace formula 
we discussed in §6 for v = oo. We therefore write suggestively |l — Pv| = |l~i^|> ^ ^ QJ* 
Let h e S{C) with compact support and /i(l) = 0. The analogue of the trace formula 
(41) becomes, once we agree to single out the fixed points with compact isotropy group 
determined by Lemma 3, 

“TVace” (U(h)) = E / . 

This is consistent with the local computation of §6 when /i(l) = 0 and introduces the sum 
over local terms in (35) as coming from an analogue for A/Q*' of a geometric trace formula 
for flows. 

These heuristics show that the validity of the Riemann hypothesis would imply that the 
above analogue of the distributional trace formula for flows on manifolds makes sense for 
the pair (X^C). This is remarkable in light of the fact that the space X displays none of 
the regularity properties needed in the proof of the manifold case. 

11 Concluding remarks 

The work of Bost-Connes and Connes leads naturally to investigating (^4/0^,^) as the 
geometric site and (IlsfW) as the spectral site for the study of the generalised Riemann 
Hypothesis. Although much of Connes ’s suggested approach to the Riemann hypothesis 
is conjectural, it seems likely that a deeper understanding of the space A/Q^ should have 
interesting consequences. The analogy between the trace formula for flows on smooth com- 
pact manifolds of [9] and the conjectured trace formula in (42), suggests the interpretation 
of Weil’s explicit formula as a trace formula on A/Q*. It should be noted that Paul Cohen 
proposed in unpublished remarks in the 1970’s that in order to understand the Riemann 
Hypothesis one should understand the nature of measurable Q* -invariant functions on A. 
The fact that the von-Neumann algebra associated to A/Q* is of Type IIoo, as shown by 
the work of Bost-Connes, indicates that it is not possible to do classical measure (Type I) 
theory on A/Q*. Non-classical ideas are therefore needed and the work of Connes is a bold 
step in that direction. 

Some very recent ideas of Connes expressed at the AMS/Clay Institute conference on 
Noncommutative Geometry in Mount Holyoke in June 2000 indicate that he may be able to 
reexpress the conjectured trace formulae in terms of indices associated to Fredholm modules 
in the sense of Connes ’s noncommutative geometry. This would lay the investigation open 
to the techniques of that theory. 

To date, no spectral interpretation of the zeros, even in the function field case, has 
yielded in itself a proof of the Riemann hypothesis: something else is always needed. In the 
function field case this extra ingredient is Castelnuovo positivity. In one of Weil’s proofs 
of the curve case over finite fields, he uses the explicit formula (which Connes conjectures 
can be described as an appropriate trace formula) together with the proof that the Weil 
distribution is positive. By contrast, Connes in [6] worked by construction with the Hilbert 
space ffs where positivity was ensured by definition and conjectured an appropriate explicit 
formula, thereby reversing in some sense the logic of the proofs in the curve case. In the 
more recent approach [7], positivity is part of the problem. One constructs a sequence of 
projections which are positive and the difficult part is to prove that their limit gives the 
Weil distribution. The spectral interpretation is there, but once more it is jiot enough. 

The conjectures and analogies in Connes ’s work seem to offer much more than an ap- 
proach to the Riemann hypothesis. They suggest a fruitful interaction between the theories 
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of operator algebras and C^-algebras and the analytic and algebraic theories of numbers. A 
likely source for the immediate future of a new input into number theory is Connes’s outline 
of a noncommutative Brauer theory given in [7]. Even if the Riemann hypothesis is proven 
by techniques independent of those proposed by Connes, the set-up he is proposing could 
then step in armed with the validity of the Riemann hypothesis to view arithmetic in a new 
and exciting way. An immediate challenge is therefore to understand and exploit Connes’s 
approach in the function field case where the Riemann hypothesis is known. This has not 
even been carried out for the case of the projective line where the analysis should be most 
tractable. 
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A HYPERELLIPTIC CURVE 
WITH REAL MULTIPLICATION OF DEGREE TWO 

Harvey Cohn 

Abstract. The analogue of complex multiplication in an elliptic curve is illustrated for a 
hyperelliptic curve with real multiplication of degree two (over C). Humbert’s equation, 
which characterizes such a curve, was derived in 1899 by elegant tour de force of contemporary 
analysis and geometry; but this equation can be derived very simply by use of computer 
algebra on the abelian integrals, leading directly to a sufficient proof of Humbert’s equation 
from his remarkable conic configuration. A set of suitable hyperelliptic coefficient parameters 
is also introduced. The relation to Hilbert modular functions is outlined in an appendix. 



0. Summary. This all began with the AGM (arithmetic-geometric mean) of Gauss 
(1799), (see [l], [5]), which became classically interpreted as a mapping of one elliptic curve 
in two-to-one fashion onto another. Since elliptic curves have one (coefficient) parameter, 
this means, loosely speaking, for some s sq' 

{y* = x{x - l)(i - 5^)} -t {vo = xo(xo - l)(io - ■s?)}- 

Then when s = sq, the curves are the same so these (discrete) values of s become “singular 
values” with “complex multiplication.” (See §1-4 for definitions and other details). 

If we deal with hyperelliptic curves (of genus two), then there was a corresponding AGM 
found by Richelot (1837), (see [l]), also later interpreted as a correspondence between 
two such curves (giving rise to an isogeny between jacobians with kernel isomorphic to 
Z/2Z X Z/2Z). Since these curves have three (coefficient) parameters (see [3]), this means, 
more loosely speaking, for some (s,t,u) — ► (so,to,wo): 

{y* = i(x- l)(x - s*)(x-t^)(x-u*)} {yo = io(xo - l)(io - «o)(®o - <o)(3;o - u*)}. 

This can not be a two-to-one mapping as in the case of genus one, but a mapping of abelian 
integrals in a manner described below. We then have a relationship due to Georges Hum- 
bert (1899), (see [10]), reducing the curves to two parameters, which he called “singular 
cases” with “real multiplication.” (See §5-7 for definitions and other details). 

Either complex or real multiplication is “singular” in the sense that there is a loss of 
dimension of the parameters. 

The real cases were derived by Humbert as a culmination of nineteenth century state 
of the art, with the use of Kummer surfaces, theta functions, and projective invariants. 
In this paper, we shall justify Humbert’s results by computer on a more elementary and 
(basically) self-contained level, by working with the abelian integrals using MAPLE. 

Presented to the New York Number Theory Seminar under the title "Analogies between real and complex 
multiplication.” 

Mathematics Subject Classification. Primary 14K22, 11F41. Key words and phrases. Complex multiplication, 
real multiplication, Jacobi manifold, Humbert’s criterion. 
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Actually, the two singular hyperelliptic parameters are algebraically related to the two 
variables of the field of Hilbert modular functions for Q(\/2) (as developed by Hecke and 
Siegel, see [7], [6] [4], [8], [16], [18]). This relation apparently is not known explicitly, but 
no further use is made of Hilbert modular functions here. 

The author had previously worked in this area, mostly on modular equations [4] for 
the (transcendental) Hilbert modular invariants. He is therefore all the more cognizant of 
his debt to Armand Brumer, Mark Heiligman, Everett Howe, Bjorn Poonen, and Jerome 
Solinas for useful conversations which eased the transition to (algebraic) coefficient param- 
eters. 

There is obviously a prepossessing need to know that the transcendental and algebraic 
results are part of the same “language,” and the author notes that Armand Brumer informs 
him of prior (unpublished) results in this area. The author also notes previous papers of 
J.-F. Mestre ([13], [14], [15]) exploring Humbert’s methods and also [3], [17] sources of 
reference. 



COMPLEX MULTIPLICATION OF DEGREE TWO 

1. The period structure. To recapitulate the claissic result, (see [5]), consider a 
lattice generated by two complex vectors, 1 and \/^, 

(1.1a) R = [1, \/^] = {n + m\/^ I n,m € Z}. 

Then for this period structure, U an abelian integral (always of the first kind) is defined 
in the period parallelogram, i.e., 

(1.1b) U e C/R. 

Complex multiplication of R by \/^ produces a sublattice of R. In matrix form, 

(1.2a) i? = (1 v/=2), 

(1.2b) (v/=2)iJ = (v/=2 -2) = R 

If we were looking for multipliers of norm two, we should find two others: 

( 1 . 2 c ) (1 + 0 (10 = (10 (} ■/). 

(1.2d) (i(l + v/=7)) (1 1(1 + V^)) = (1 1(1 + v/=7) (J -/) . 

Of course, we note the above matrices are of determinant 2, and likewise 

(1.2e) AT(v/= 2) = JV(l + 0 = + = 2. 

2. The elliptic curve. If an elliptic curve has a two-to-one mapping onto itself (a 
two-isogeny) , then its period lattice must be equivalent to itself by a complex multiplier 
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mapping this lattice into a sublattice of index two. Again classically, this is an isogeny 
found as follows: 

The elliptic curve is written with (cross-ratio) parameter A(= s^), 

(2.1a) = x{x- l){x - s^), (s^^O.l). 

We write the parameter A as a square with the hindsight that its square root s will emerge 
in the computation. There are many equivalent forms of the elliptic curve (2.1a). The 
y- invariant, defined as 

(2.1b) y[A| = 256(1 - A-f A2)3/A2(A-1)2, 

is algebraic in the coefficients of (2.1a) and it classifies elliptic curves up to isomorphism. 
Note that y[A] is invariant under the transformation group (5s) permuting the points 
{0,1, oo}: 

(2.1c) A A' G (A,l/A,l- A,(A-1)/A,A/(A-1),1/(1- A)} 

If the period parallogram has ratio r (with > 0), then j has the well-known tran- 
scendental expansion in q = exp 27rtV, 

(2.1d) j (r) = 1/q + 744 + 196884^ + 214937609^ + • • • . 

(The direct connection with the periods like r and the coefficients like is not easily 
available in the hyperelliptic case which follows). 

The traditional Gauss-Landen transformation. 



(2.2a) 


(x-l)(x-3^) 
° i(l + s)^ 


(2.2b) 


y(x^-3^) 
x^(s + l)^' 


generates the unique correspondence of xq with the pair {x^s^ /x} which extends to 


(2.2c) 


(xo,yo) (x,y). 


With the new parameter. 




(2.2d) 


5 — 1 

50 = . - , 

5+1 


we verify by substitution. 




(2.2e) 


yo = xo(xo - l)(xo - 5 q). 
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Note that we replace lo by 1 — xo then sq would be replaced by 
(2.2f) 






the ratio of geometric-to- arithmetic means of 1 and s. 

Since x = oo corresponds to xq = oo, we see from (2.1a) and (2.2de) that 

(2.3) i(s + 1) / — = f (modulo periods). 

J oo y J oo yo 

In terms of the j-function, we have the two values 
(2.4a) X = j[3% Y = y[4|, 

and on eliminating s and so using (2. 2d), we obtain the modular equation of order two, 
Y) =) + Y^-X^Y^ + 1488XF(X + Y)~ 162000(X2 + Y^) -f 40773375Xy 

(2.4b) + 8748000000(X + Y)~ 157464000000000 = 0. 

3. Verification of Gauss-Landen. We return to the substitution (2.2a) to see how it 
might be used as a model for the hyperelliptic case where the abelian differentials (always 
of the first kind) are mapped rather than curves. 

We try to reconstruct the relation (2.2a), starting with 

(3.1a) xo = 

X 

This is reasonable since the singularities x G {0, l,oo,s^} go into the subset xq G {0,oo}. 
To find the constant c we try a substitution into the abelian differential of the first kind, 
namely 

^3 _ s/cdx(x - s)(i + ^) 

V^o{xo - l)(io - «o) \/x{x- l)(i - s^)gi{x)q 2 {x) ’ 

where we designate 

(3.1c) Qi(x) = cx^ — cxs^ — cx cs^ — X, 

(3. Id) 92 (ic) = CX^ — CXS^ — CX CS^ — SqX. 

It is now clear that the quadratics qi{x),q 2 [x) can be conveniently taken as perfect squares 
to make sure the right-hand radical in (3.1b) remains elliptic. Thus 

(3.2a) disc 9 i = (cs^ - 2cs -h c + l)(cs^ -f 2cs + c + 1) = 0, 

(3.2b) disc 92 = (cs^ + 2cs + c + 5o)(cs^ — 2cs H- c -f- 5 q) = 0. 

There are four ways the two discriminants can vanish, we choose the second factor from 
each of (3.2ab) and solve each for c. Thus a good choice is 

= -(7W = -(^’ 

and the common value of c also yields the value of sq in (2. 2d). 

In essence we derive the mapping (2.2abd) as one of many mappings for producing a 
relation like (2.S). The parametric relations on the coefficients are only proved sufficient. 
The fact that they cover all cases of degree two is unimportant here since a corresponding 
result is not attempted for the hyperelliptic generalization. 
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4. Singular values. The singular cases of (2.2abd) are those discrete values of the 
parameter s where 



(4.1a) 






so the elliptic curve is the same in (2.1a) and (2.2e). Now (2.3) leads to 



(4.1b) 



dx dx 

fj, I — = / — modulo periods, 

Joo y Joo y 



where /x is a complex multiplier (to be determined below). Thus we have two periods of 
the same elliptic curve with ratio equal to the complex multiplier /x. 

When (4.1a) leads to sq = ±5 then /i == ;(s -f 1). When, because of (2,1c), we use 
A' = 1 — A, or 1 — 5^ = Sq, then fj, = {s + 1) (as the transformation x — ► 1 — x must be 
used to cancel ^ 1 — and this removes the t). The information is all contained in 

this table: 



root value of /x 

s = -l-\-y/2 i{s + l) = x/^ 8000 

s = t i(s + 1) = -1 + 1 1728 

s = (-3 + >/^)/2 s + 1 = (-l + \/^)/2 -3375. 

REAL MULTIPLICATION OF DEGREE TWO 
5. Real multiplication. Here we are dealing with a period matrix arising from 
(hyperelliptic) curves of genus two. There are two independent abelian integrals (U, U') of 
the first kind with four pairs of periods for four independent paths on the Riemann surface. 
Analogously with §1 (above), there are four pairs of period vectors forming a lattice 



(4.2a) 



equation 
So = —5 
So = S 

sl = l- 



(5.1a) R — [R.iy R, 2 i R-3i R-a] — {n-iR.i + U2i2.2 •+■ ^ 4 -^- 4 } (u» E Z). 

Then (7, U' are defined modulo periods as follows: 

(5.1b) {u>) ^ 

To simplify the lattice R, we select linear combinations of U, U' using GL(2, C) to 
produce suitable independent abelian integrals and from GL(4, Z) for the periods to reduce 
(5.1a) to a standard form showing three parameters p,p',p” (subject to sign conditions 
deferred to §13), 

( 5 -lc) ^=(01/ /')• 

(This is possible since the periods are those of abelian integrals). 

The essence of real multiplication is that another application of S' 6 GL(2,R) on R can 
produce a subbasis (as in (1.2b)), 



(5.2a) 



SR = RM, MeGL(4,Z). 




no 



This can happen only in special (singular) cases: Assume, e.g., 2p" = p. Then 

sn. [II V = 

This is called a “real” multiplication because e.,g., the relation 
(5.2c) 5* =( 02 )= 

shows S has real eigenvalues ±\/2- Also the period vectors of SR form a lattice as in (5.1a) 
with the property that 



(5.2d) 



R/SR « Z/2Z X Z/2Z. 



The (absolute) value of the determinant S leads to the characteristic property (5. 2d) of 
lattices (and jacobians). 

6. Hyperelliptic curves. The hyperelliptic curve (always of genus two here), has 
three parameters (again written as squares), 



(6.1a) 



H : = x(x — l)(a: — s^)(x — t^)(x — u^). 



We refer frequently to the “quintic” roots {oo,0, or the Wexerstrass points, 

(which are assumed distinct, of course). The abelian differentials form a linear space of 
dimension two. Generators are chosen for convenience as dU and dU* with 



(6.1b) 



U 



. r(£:^, . r 

j 00 y J 00 



(x H- s)dx 

y 



We are considering a correspondence of H with Ho , where 
(6.1c) Ho: yo = a;o(xo - l)(xo - 5o)(^o - ^o)(^o - Wq)> 

We try the mapping 



(6.2a) 



(i-l)(i-«*) (lo - l)(a:o - So) 

= c . 

X Xo 



It clearly is analogous to (3.1a), but (2.2) fails. For now, we only say 
(6.2b) 

from the correspondence of the pairing 



2 to 2 

Xo < — X, 



(6.2c) 



{io,s*/io} {x,S^/x}. 




Ill 



We may not, however, write an analog for the correspondence (2.2c); the best result is 
blemished by a ± sign: 



(6.2d) (xo,yo) (x,±y). 

6.3 Lemma. A curve C of genus g can not have an N-fold cover by a curve C' of genus 
unless > 1 + N{g — 1). 

The proof follows from the Riemann-Hurwitz formula 
(6.3a) 2(^'-l) = 2iV(^-l) + W, 

where W is the (nonnegative) number of branchings of the covering. We show some simple 



illustrations when N 


= 2: 






g' 


9 


W 


C' C (two-fold cover) 


0 


0 


2 


= - {y = x} 


1 


0 


4 




1 


1 


0 


“Gauss-Landen” 


2 


0 


6 


{y2=i6-l} _ {y=a; 6 _l} 


2 


1 


2 




2 


2 W 


A 

o 


impossible! 



Yet when we construct the differentials (in §10-11 below) we shall see the correspondence 
of jacobians is in one-to-two fashion (without sign ambiguities). 

The correspondence (6.2a) comes indirectly from Richelot who first broached the gen- 
eralization of the Gauss-Landen transformation. His work is reconstructed in [l] with 
an ingenious formalism. The explicit form of (6.2a) is deducible from it but comes more 
directly from Konigsberger [12). In Rohn [19], the two-to-two relation is described, es- 
sentially as a four-to-one mapping, not of curves but of the pair {UyU*} into its image, 
i.e., an isogeny of the two jacobians. We reexamine such matters in the light of Humbert’s 
concepts. 

Our objective for now is to specify c and sq (6.2a) so as to produce singular real multi- 
plication. This will mean a self-isogenous jacobian. 

The field determined by the coefficients of H in (6.1a), namely Q(s^,t^,u^) is not large 
enough for the coefficients of Hq, even the field of the parameters of Hy 

(6.4a) k = Q{SytyU)y 

requires an extension field, [K : k] = 2 (see §7 below), so that 
(6.4b) c,«o,<o.“o € 

7. The transformation parameters. We note that (6.2a) yields two values of x for 
each xo- Rewriting it as 



c{xo + — - 1 

Xo 



2 ) ^ 



(7.1a) 
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we find the conjugates x,x' satisfy 

(7.1b) xx' = 5^. 



We work directly (as in §3) with the differential from (6.1b), 



(7.1c) 



dU 



dx(x — s) 

\/x(x — l)(x — 5^)(x — t^)(x — U^) 



Our purpose is to transform it into a differential in xq. 
From (6.2a), 



(7.2a) 




So, dU may be split into two factors. 




(7.2b) 



dU = dCl A, 



where 



(7.2c) 



dVt = 



dXp(xg - Sl)y/C 
Xoy/xo{xo - l)(xo - 5g) ’ 



A = 



{x + s) y/{x- t^){x - u2) 



Using the conjugation notation of (7.1b), we see x — ► x' leads to dU —*■ dU\ A 
while dfl = dn'. Then invoking Abel’s symmetry theorem, the sum of differentials 



(7.2d) 

dU + dU* = dn(A-f-A'), 

(7.2e) 

A + A' = =f==== + ^ 

(x + s) y/(x — t^)(x — u^) (x' -H s) yj (x' — t^)(x' — u^) 

should be a differential on Hq in (6.1c) from the mapping (6.2b). 

It is a well-known trick to write the sum of radicals as a radical, e.g., 



(7.3a) 



[A + A') = Va^ + +2AA'. 



Consider the internal radical. 



(7.3b) AA' = . 

(x + s)(x' + s)y/{x - ^2)(x' - t^){x - U^)(x' - U^) 

Use is now made of the symmetric functions 

(7.3c) xx' = 5^, x + x' = 5^ + 1 + cxo - csg — c -j- csg/xo (= <7i). 
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As in (3. led), we obtain quadratic expressions which we make into perfect squares, such 
as 

(7.4a) (x — - t^) = = q\{xo)/xo, 

where 



(7.4b) Qi{xo) = —t^exQ — xq(—s^ — I^cSq — t^c) — I^csq 

The discriminant of gi(xo) vanishes, i.e., 

(7.4c) A(t) = {-t^csl + 2i^cso-t^c-^t'^-t^-s’^^t‘^s^) 

(7.4d) {-t^csl-2t'^cso-t^c-^t^ = 0. 

If we worked with the factor {x — u^){x' — u^) instead, we should obtain similarly A(u) = 0. 
In either case there is the symmetry sq — ► —sq. 

If we solve for c in the (first) factor (7.4c) from A(i) (as shown), and if we also solve for 
c in the (second) factor (7.4d) from A(u), we obtain two equations 

(7 /. - ~ _ (-1 -h ^)(^ + I)(^ ~ ^)(^ + ^) 

^ [so- l)H^ ~ {-so - l)^u2 

The final result is now seen. We need K = k(uj) where 
(7.5a) -u^). 

Then, comparing the two fractions in (7.4e), we find 

. . _ ut^ — ut^ — us^ + ut^s^ + cjt 

ut^ — ut^ — us^ ut^s^ — tjt' 
and by taking the mean of the values in (7.4e), we find 

«u(5§-l)- 

Finally, AA' is rational (no square roots), i.e., 



— rof 

q2(xo)tuc{xo - sq){xo + So) ’ 
q2(^o) = CXq + 2sXo -f -f Xo - CSqXo — exo + CSq, 



It would seem purely “routine” as a problem in computer algebra (e.g., MAPLE V) to 
find the square root of A^+A'^±2AA' but unfortunately the field of K = k{u) presents too 
many difficulties because of the four parameters The fact that cj is a dependent 

parameter makes it even harder! (The persistant error message is “object too long”). 

We could resort to the parallel formalism of Richelot to find expressions for tg 
Uq, using the four parameters, s,i,u,u; (and an additional radical), but there is no exact 
analogue of the Gauss-Landen transformation (2.2ab). 

We do not show this work here, as our purpose is to specialize such results to the singular 
case only. It will be characterized by s = sq although there are many equivalent ways, 
e.g., 5 = 1/^0 > 5 = ^ 0 ) etc. (Recall that we only claim a sufficient condition for real 
multiplication.) 
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HUMBERT’S CRITERION 

8. Humbert’s equations. We shall derive Humbert’s criterion [10] as a sufficient 
condition for real multiplication of degree two. Humbert showed it is also necessary, (by 
stronger techniques using the Rummer surface and the theta-function). 

We start with the condition that s = sq. Then eliminating u in (7. Sab), we obtain the 

(8.1a) resultant = (t^ — l)(s^ — ^^)/io ^2 (= 0)* 

Here ho = 0 and /12 = 0 are two of these four equations generated under changes s —*■ 
— — ► — t,u — > — u: 

(8.1b) ho = s^t — s^u — st — su t^su + t^u + su^t — 

(8.1c) hi = s^t — s^u st su — t^su + t^u — su^t — 

(8. Id) h2 = —s^t — s^u st — su t^su + t^u — su^t 

(8.1e) hs = s^t s^u + st — su t^su — t^u — su^t — u^t. 

Thus, if we are dealing with the actual coefficients ( 5 ^, u^), we should have to multiply 

all four and get Humbert’s equation, 

jET(5^, = hohih2hs = 2s^u^t^ 12s^u^t^ — 

- + 2s®U^ - 

- + 2u^t®5^ -f 12u®t®s^ + 12u^t®5^ + 2s"^uH^ 

(8.1f) + 2 s^uH^ + 2s^u^t^ - + 12s%^t^ - 

-f + 2u^t^s^ + 12u^i^s® 

- 4- 2s^u^t^ + 12s^u®t^ - 

- 16s®u^< 2 + 2t^5®u2 - (= 0). 

Curiously, Humbert failed to use square parameters, so his equation did not have this 
factorization. It was written (essentially) as homogeneous of degree eight. 

To see the symmetries involved, number the branch points of the hyperelliptic curve 
(6.1a), and introduce v(= 1) for homogeniety. (Thus s —* s/v,t t/v^u —> u/v in (8.1b-f) 
with the ho,- - • ,hz and H replaced by their numerators v^ho, --- , v^hz and v^^H). Then 
the (coefficient) parameters are 

, X “quintic” roots 0 00 

* * vertices 1 2 3 4 5 6 

Accordingly, the equations (8.1bcde) and (8. If) are invariant under the dihedral group (of 
order eight): 

(8.2b) D4 = < (3456), (35) > . 

Therefore, with 4!/8 = 3 equations conjugate to (8. If), and 15 ways of selecting the two 
points { 1 , 2 } for 0 and 00 , there is a total of 45 equations of degree 8 in the quintic 
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coefficients which constitutes Humbert’s criterion. The process of defining the manifold 
of real multiplications of degree two itself is far from ecLsy. (For the total manifold of 
hyperelliptic curves, see [11]). 

A simplified version of the factors of Humbert’s equation, can be given with the help of 
the symbol 

(8.3a) [ai] = 

Then with v(= 1 for homogeniety) , we can see 
(8.3b) ho = 5tui;([ts| + [su] + [uv\ — [vt]), 

(8.3c) = uv[t^ — s^) 4- vt[s^ — u^) + ts(u^ — v^) — su{v^ — t^), 

and (ignoring sign), all come from ho by the cyclic permutation of (v,t,s,u), 

from (8.2b). 

Finally, from ho = 0, we obtain a rational parametrization, using 
(8.4a) r = ut, 



then with only two parameters r and s, we express all coefficients of the quintic, i.e.. 



(8.4b) 

(8.4c) 



r — rs 4- 5^ + 5 

jr 

r 4- rs 4- 5^ — 5 
r 4- rs 4- 5^ — 5 
r — rs 4- 5^ 4- 5 



(= 

(= 



Oi), 

aa), 



and the singular quintic (from (6.1a)) now becomes 



(8.4d) 



= x{x — l){x - s^){x — ai){x - 02 ), 



9. Humbert’s geometric criterion. From the singularities of Rummer surfaces, 
Humbert was able to express his criterion as a property of conic sections. 

9.1 HUMBERT’S CONIC CONSTRUCTION. Take an arbitrary (nondegenerate) conic C in 
the x,yyZ projective plane and parametrize it in any way, e.g., 



(9.1a) C: z:x:y = <^o(^y) : <^i(iy) * <^ 2 (^), 

where <l>j(w) represent suitable polynomials of degree at most two. Then let {1,2, 3, 4, 5, 6} 
represent the points found by substituting for w the values from the table (8.2a)y (with 
V = 1). Construct the tangents at all six points. The tangents at 3, 4, 5, 6 (in that order) 
form a quadrilateral by the successive intersections, 34,45,56,63, as in Figure 1. 

Then the condition for real multiplication of degree two is that another conic C exist 
which has a common tangent with C from points I and 2 and which passes through the 
vertices of the quadrilateral 34,45,56,63. 

The details were omitted by Humbert, but we can make a straightforward computation 
to verify that the conic construction is equivalent to Humbert’s equation (8. If). 
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Figure 1, Humbert's Criterion 



Actually, all conics have (in C) parametrizations projectively equivalent with each other, 
hence with the simplest: 

(9.1b) C : y = x = (z = l), 

(in affine coordinates). We note that on C, 

(9.1c) {tangent line at (a, a^)} = {y -h = 2xa}. 

The intersection (x,y) of the tangents at {gi^gl) and (y 2 »fl' 2 ) 

(9.1d) {y + ffi = 2ig,} n {y + 92 = 2X92} = (^^-^-^,9192). 

Therefore we look for the second conic C' having the common tangent to C at oo and 
0. These two conditions leave three parameters in 
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(9.2a) C' : (g(x,i/) =) + 2i4x + -h 2Gy = 0. 

The parameters A, By G result from the common tangencies as follows: 

( 9 . 2 b) (x,y)->oo => ^(x,y) w (x + 5 y)^, y -► 0 => q[Xyy) « (x + A)^. 

We now have four equations for the three unknowns A,jB,G: 

(934 =) ,<^) = 0, (946 =) = 0 

( 9 . 2 c) (966 =) 9(^-y^,S*U*) = 0 , (963 =) 9( “ - y - ,u^) = 0 . 

We first eliminate Gy (which is linear), between the three successive pairs of the equations 
934 ,^ 45 , 956 ,^^ 63 , SO 934,945 result in (say) ^345, etc. 

We thereby obtain a trio of equations, 9345, 9456, 9563, nonlinear in A and B. Now we 
have a choice of which of A or -0 to eliminate next. 

So we eliminate A between the two successive pairs and 9345,9456 result in (say) 93456, 
etc. We now have the two equations 93456,94563* The remaining variable, namely 0 , is 
then eliminated from this one last pair to obtain the (partial) “resultant-( 7 A 0 .” 

We next eliminate 0 first (before A) so we obtain the corresponding (partial) “resultant- 
G 0 A.” 

It turns out that each of the partial resultants has extraneous factors, which are elimi- 
nated by comparison. Thus to within a numerical factor, 

( 9 . 2 d) gcd(resultant-GA 0 ,resultant-( 70 A) = 0 (s^,t^,u^). 



INTEGRAL IDENTITIES 

10 . Transformation of abelian integrals. We complete the identification of the 
singular case ( 8 . 4 bcd) by showing the two-to-two mapping of each of the hyperelliptic 
curves H and Hq of §6 and the mapping of the abelian integrals. 

Now with s = SQy the computer algebra simplifies sufficiently. We state the results and 
sketch the computations in §11. Starting again with a relation of type (6.2a), 

(lO.la) (x-l)(x- 3 ^) ^ ^ (XQ - lj(xo_^ 3 ^ ) ^ 

X Xq 

and we look for a constant to provide the self-mapping of the abelian integrals. We shall 
show in the next section 



(,2_6^)(r + 3^)(r-l) 
r(r — rs -}- 52 + s){r -f- rs + — 5) ’ 



(10.1b) 



AC 
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We shall also see that the elliptic curves are 

(10.1c) H: = x(x — l)(x — 5^)(x — ai)(x — 02), 

(lO.ld) Ho: Vo = xo{xo-l)(xo- s^){xo-bi)(xo-b2), 

where ai and 02 are given in (8.4bc) and 61 and 62 are derived by the involution 

(lO.le) r 5^(1 - r)/(r -H 5^), s s, 



which transforms 



(lO.lf) 



(r - l)3^(r -f g) 
(r + s2)(r-s) 



61, Oj 



(r - l)a^(r -3) 
(r + ^ 



62. 



We further note from (10. la), for given xq (or x) that two conjugate values of x (resp. 
xo) are uniquely determined by 



( 10 . 2 ) 



xx' = 5^, XqXo^ = 5^, 



while at x' (or xqO the corresponding values y' (resp. yo') is determined with ambiguity 
of sign. 



10.3 Main Identity. The abelian differentials at the conjugate points add as follows: 



(10.3a) 

(10.3b) 

(10.3c) 

(10.3d) 



dxo(xo-5)/yo + dxo [xq - s) ! yo* — 1 \ dx{x - a\)!y, 

dxo[xQ-s)lyo - dxo [xo - s) j yo' = dx* {x* - ai) / y* , 

dxo[xo + s)!yo H- dxo'(xo' + 5)/yo' = 72 dx(x - a2)/y, 

dxo{xo + s)!yo - dxo[xo+s)lyo = 72 dx'(x' - a2)/y'. 



(10.3e) 



Ti = -- 






The constants satisfy 

_ (r rs + ~ s)(r^ - s‘^){r 4- ^ 

(r — rs + 6^ -f 6)rs2(r — 1) 

Note that the relation between ( 10 . Sab) and (lO.Scd) is s —5, leading to (ai,a2) 
(02, ai) and -yi -+ 72- 



(r^-5*)(r + s2) 

^172 = 2/ — 

rs^{r — 1) 



We must consider the ambiguity of signs. First of all, 71 has an arbitrary sign, but it 
determines 72. Furthermore yo and yo* can also be chosen arbitrarily, but then yy* h2is a 
sign determined by the product of (10.3a) and (10.3b) (which cancels the ambiguity of yo 
and yo'). The signs are consistent for (I0.3cd). 

We define the jacobians of H and Ho on each Riemann Surface (modulo periods of 
course) as column vectors with components consisting of abelian integrals with upper 
limits P = (x,y), Pq = (xo,yo), etc., and some convenient lower limit, say 00. Thus 

(10.4a) 

H: u(P) = 'll j dx{x- ai)/y, v(P) = '12 J dx(x-a2)/y, 
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(10.4b) 

/ Po rPo 

dxo(xo - s)/yo, vq(Po) = / dxo{xo -f- s)/yo. 

Thus the jacobians are the column vectors 

(10.4c) H: J[P) = (u(P),t;(P))^^ 

(10.4d) Ho- MPo) = (u(Po),t^{Po))‘^ 

Note that a change of (say) (x,y) — ► (x,— y) will cause u — + — u, v — > — v, J{P) 
-J(P). 

10.5 Mapping of jacobians. Theorem (10.3) may be rewritten with (10. Sac) and 
(lO.Sbd) grouped as follows: 

(10-5a) do{xQ,yo) + Jo(xo\yo) = J[x,y)y 

(10.5b) Jo{xo,yo) + Jo{xo\-yo') = J{x\y'). 

The Jacobi inversion theorem leads to 
(10.5c) 

(u(x,y),v(x,y)) {(uo(xq, yo), vo(xq, yo)), ( 1 ^ 0 ( 2 :^, yo), vo(xo, yo))}, 

(lO.Sd) 

(u(x',y'), v(x',y')) -► {(uq(xo, yo), ^ 0 ( 2 : 0 , yo)), (^o(a:o, yi), vo(a:o, yo))}. 

In Jacobian notation, more simply, 

(10.5e) J{P) -> {Jo(Po),^o(P(5)}, 

(lO.Sf) J(P') {Jo(Po),-J^(P(5)}. 

Thus the jacobians of H and Hq have a one-to-two correspondence that the curves could 
not have (recall ^6). 



11. Proof of the main identity. The steps follow the pattern of §7 (with s = so), so 
a brief sketch might suffice. From ( 10 . 1 a), (with /c as yet unknown), 

( 11 . 1 a) fzdxo{xQ — s^)/xl = dx{x^ — s^)/x^. 

Together with the internal substitution of ( 10 . 1 a) into ( 10 . 1 c), this gives us 

( 11 . 1 b) left hand ( 10 . Sab) = dx(x s ) _ ^ 

^ ^ ^ ^ y/l^Xy/x{x-l){x-S^) ^ 



( 11 . Ic) A = 



Xo 



(xo 4- 5)«v/(a:o - 5 i)(xq - 62 ) ’ 



A' = 



Xo' 



(xo' 4 - 5)v^(xo' - 6i)(xo' - 62) 



What remains is a smaller computer algebra exercise than in §7 (fewer parameters): We 
compute [AA'Y and 4 - A’^ as rational functions of x using ( 10 . 1 a), e.g., 
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(11.2a) XqXo' = 5^, Xo-fXo' = 1 -h <s^ -f (x — l)(x - 5^)/(x/c). 

As in §7, AA' is rational when (xq — bi){xo' — b\) is a perfect square. The discriminant 
condition (as in (7.4cd)) gives the value of k in (10.1b). We have two values ±AA'y so we 
are simultaneously computing both of (10.3ab), 

(n.2b) A±A' = s/a^+A''^ ±2AA'. 



The radicand becomes a perfect square again, and this gives the differential shown in 
(I0.3ab). 

Finally, (10.3cd) follows from the substitution s — ► — s. 



12. Self-mapping of jacobians. The mapping of jacobians (10.5) from Ho(xQ^yo) 
to H(x,y) can be recast as a mapping from H{x,y) to itself (actually H{z,w)) by the 
biunique change of variables 



(12.1a) 

(12.1b) 



Xo = {z- l)s^/(z - s*), 

yo = tu(s + l)(s - l)^5®/(z - 5*)®7o. 



so the equation (10. Id) for Hq becomes that of H again, i.e., 

(12.1c) H: = z{z — l){z — s^){z ~ ai){z — a 2 ). 

The transformation (12.1a) comes from (10. la) with x and z symmetrically disposed; 
(x-l)(x-s^) {z-l)(z-s^) 



(12.1d) 



k(s^ - 1 )^ 



Here, k comes from (10.1b) and 
(12.1e) 



'll 



((,2-32)(, + ,2)2(,-1) 



(r -f- rs + 6^ — s)(r — rs + -f 5)(s + 1)5^ * 

The abelian differentials on the left in (10.3a-d) can be transformed back to H{z,w) by 
means of 



(12.2a) dxo(xo - s)lyo = ^odz{z + s)/w. 

At the same time the differentials on the right can be transformed to the “ standard” ones 
in (10.4a) by 

(12.2b) (x-ai) = (x -i- s)a -f (x - s)/?, (x - U 2 ) = (x - s)a* + (x + s)/?*, 

(12.2c) a = (5-ai)/2s, 0 = (5 + ai)/2s, 

(with the operator * denoting the involution s — ► —s). 

The transformation (10. Sab) of jacobians (as column vectors) now becomes 

(12.3a) J{z,m) + J(z',w') = MJ(x,y), 
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(12.3b) 

Here M is the matrix 

(12.3c) 

with 

(12.3d) 



J{z,w) + J{z\-w') = MJ(x',y'). 




(s + l)(r + rs + 5* — s)* 
(s-l)r(r-l)(r + s^) ’ 



(12.3e) 



77 = 



(r 4- T5 4- 5^ — 5)(r — r6 4- 5^ 4- 5 ) 
r(r- l)(r 4 - 52 ) 

It can then be seen (using the choice of signs from (12.3de)), that M had determinant 
—1 and trace zero. Hence its eigenvalues are ±1. 

We now choose new jacobian coordinates to follow the eigenvectors, (say) 

(12.4a) j{z,w) = {J dz{eiiz + ei 2 )/w, j dz{e 2 iz + sa) I , 

and likewise (with the same eigenvectors) for y(x,y). 

With this choice of jacobian (12. Sab) becomes 

(12.4c) j[z,w) + j{z',w') = Tj{x,y), 

(12.4d) j{z,w) + j{z',-w') = Tj(x',y'), 



As before, (12.4c) and (12.4d) are equivalent (only one need be used). 

The involution T and the correspondence of {z,w) with {z\w') justify the kernel Z/2Z x 
Z/2Z in the isogeny of the jacobian j{z,w). 



APPENDIX: THE HILBERT MODULAR INVARIANTS 
13. Hecke’s parameters. The period matrix R in (5.1c) must satisfy the sign condi- 
tion 



(13.1a) - i^p'y > 0, 

when the periods are those of abelian integrals. A period matrix with such sign conditions 
is called a Rtemann matrix. We had the further restriction to produce real mutiplication 

(13.1b) p = 2p". 

Hecke [7] introduced new variables 

(13.1c) (p/2 =) p" = (t - r')/2\/2, p' = -(r + t')/2, 

for which the sign condition (13.1a) is satisfied by 
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(13. Id) Sr > 0 > Sr^ 

In (5.1c) we now replace R hy SR^ (where the matrix S denotes a linear combination of 
the abelian integrals), 

( 13 . 1 .) 

Then the Riemann matrix R is rewritten as 

^ " (l -r')- 

There are (again) four periods shown as column vectors for the two abelian integrals 
represented by the rows. There is an obvious equivalence under change of variables through 
multiplication by GL2(C) on the left and unimodular change of period bzisis through 
multiplication by GL2(Z) on the right, i.e.. 



(13.2b) GL2(C) •/2*SL4(Z). 

The real multiplier ring isomorphic to Z(\/2) is put in evidence (again) by the real action 
on the variables: 

(«-) (f « = 

The right-hand matrix columns clearly show only a subgroup of the periods of (13.2b) of 
index 4, i.e., of structure Z/2Z x Z/2Z as in (5. 2d). 



14. The Hilbert modular group. Generically, this is the group G acting on the 
pair (r, r') which preserves the equivalence classes of the Riemann matrices with real 
multiplication. 

In our context, first there is a symmetric subgroup G* of index 2, 



(14.1a) 



ar + j3 , a'r' 4- /?' 
qfr + ^ ^ 7'r' 8 ' * 



Here a,/?,7,^ lie in Z(v^), and a',/?',7', <5' are the conjugates under y/2 
determinants also must satisfy 



-y/2. The 



(14.1b) aS — P'y = A, a'S' — /3'^' = A', 

for A a totally positive unit, actually n = (1 -f \/2)^"', (m G Z). (In this case, it suffices to 
take A = A' = 1 by using a factor of on a,/?, 7,^, - likewise for the conjugates.) 

The full Hilbert modular group has the alternating operation 

(14.1c) A: T -r', r' -r. 



So G = < G* ,A >. This group preserves the half-planes 9'r > 0 > ^r'. 
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To see the action on the Riemann matrices (cis in Hecke [7]), first let the element of C?* 
act so R —* G*(R). Then we remove the denominators by row multiplications, 



(14.2a) 

where, 

(14.2b) 



» _ f + ^) -^2(^7 + ^) \/2(ar + 0) 

V(7V- 



— (ar 
-{a‘ 



OIT +0) \ 
’r' + 0')J • 



' + £') v5(7V + 6') -v/2(oV + /3') 

Using integral components {ao,*** ,di) such that 

(14. 2c) o: = ao + ax\/2? P = bQ + biy/2, 7 = + Ciy/2,6 = do diy/2, 

we can rewrite R* in terms of rational integers, 



(14.2d) 



R* = R • A/, where M = 



do 


-2di 


to 


-di 


do 


-bo 


Cl 


— Co 


CLq 


-Co 


2cj 


— 2a\ 



Since it can be verified that 

(14. 2e) determinant (M) = (a<5 - /?7)(a'<J' - /?V) = 1» 

we see the substitutions of G* preserves the equivalence class of R. 

It is a trivial matter to see that the alternating substitution 



(14.3) 



A : 



— r , T 



-T, v/2 -► -\/2 



simply interchanges the rows of R. So the Hilbert modular group G =< G*^A > is a 
group preserving the equivalence classes of R, 



15. The modular function field. The Hilbert modular function field for Q(\/2) is 
the analogue of Q(i(r)) for the Klein modular group acting on r. 

We start with the symmetric subfield Q(x,y) (see [6]). There 

(15.1a) = HIIH 4 , (y=)y{T,T') = 



and Hk denotes special modular forms of (even) weight A;, e.g., 

(15.1b) G*: HkiiaT + 0)/{'yT + 6)) = {{■^T + 6]{'r'r' + S')}'‘H{Ty). 

The symmetry is derived from 

(15.1c) A: Hk(T,T') = 



The selection of the forms Hk of weight k is made so as to have simplest infinite behavior. 
In effect, we have the following diagonals: 

T>i : t{1 + V 2 ) = 7-'(l - \/2) = wi, 

D2 : r = —t' = 0)2. 



(15.2a) 

(15.2b) 
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Then H 2 ,H 4 ,Hq are chosen uniquely as the modular forms of indicated weight for which 
the diagonal behavior is like the Klein- Weber modular function ^((^2) on D2 and like the 
Hecke V^-modular function on Di. (Recall J2(t^i) = j2(-l/^i) = ^2(^1 + 

Thus (see [6] or [4]) 

(15.2c) D, : 1/x = Hi/Hl = 32 (^ 11 ), = 0, 

(15.2d) Dj : l/(zy) = H^/Hl = j(i^ 2 ), -^4(^2) s 0. 

The field Q(x, y) is characterized by the symmetry (of r — r',r' — r). The extended 

or alternating field is Q(x,y,a) with a determined by choice of sign in 

(15.3a) (a^=)a(i,y)^ = i(y + 4)(x=*y - 1728x - 288xy - 102V + 4xy^), 

according to work of Gundlach [6] and Nagaoka [16]. Here 

(15.3b) a(x(r,r')>y(’',r')) = -o(x(-t-', - r),y(-r', -’■))• 

(The factor of ^ will be needed to avoid irrationalities later on). 

16. Explicit twQ-isogenies. 

Three 2-isogenies come from r — + (2-h\/2)7-, r' — ► {2 — ^/ 2 )T', (note the multipliers 2±\/2 
have norm 2 and preserve the signs). Actually, the three mappings arise concurrently by 
coset properties in the Hilbert modular group 

r ((2 + V2)r, (2 - V2)t')) 

(16.1) (r,r') ^ ] (7-/(2 + V^),r'/(2-v/2)) (x, y, a)‘ (X, K, A). 

l((r + l)/(2 + v/2),(r' + l)/(2-v/2)) 

This is analogous to the triple t — > {2r, r/2,(l -H t)/ 2 } for the ordinary (Klein) modular 
group, which produces three conjugates y(r) — > {y(2r),y(r/2),y((l + r)/2)}. 

For simplicity, we begin by ignoring the mapping a — > A, leaving only (x,y) — ► (AT, F) 
(see Cohn [4]). We then start with a two- valued correspondence (to be resolved later by 
the choice of sign of a). 

Given (x,y) the symmetrized (X, F) are determined by an incomplete intersection with 
four cylindrical equations of co-dimension 2, (note the degrees of the variables): 

(16.2a) 

/i(a;,y;X)= + (432 + 156y - xy)X^x + a:(xy + 144y - 1728)^ 

-h(4x^y2-f 207x^y+1152y2x -f- 19008yx + 62208x4- 82944y^)X = 0, 

(16.2b) 

(/2 =)/i(X,F;x) = 0, 

(16.2c) 

/3(^,y;F) = FV(t/ + 4) - (x - 4y + 48)y^F2 -f (-5y -f 108)xyF 
-f x(xy + 144y - 1728) = 0, 
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(16.2d) 

(/ 4 =) MX,Y;y) = 0. 



The symmetry of (x,y) (X^Y) is reminiscent of the symmetry of x = j(cj) and X = 

j(2uf) in ^2(^9 X) = 0, the (cubic) Klein modular equation of order 2. 

17 . Uniqueness problem. As we see from the three 2-isogenies, for every (x,y) there 
should be only three pairings of (XyV) with (x,y) (not nine from the determination of 
three values of X from /i = 0 and three values of Y independently from fs =0). The 
equations /2 = 0 or /4 = 0 (generically) resolve the three legitimate pairings (XjY). 

If we take discriminants of the cubics fi and /a separately, 

(17.1a) AT-disc(/i) = 4xy^(x^y — 1728x — 288xy — 1024y^ -f 4xy^) 

• (3375x^ - 746496y + 10368xy -h 252x^y + x^y)^, 

(17.1b) y-disc(/3) = 4(x^y - 1728x - 288xy - 1024y^ -j- 4xy^) 

• (432 - 72y -f- xy -f Zy^^y^x. 

The fields Q(x,y,AT) and Q(x,y, y) are the same as we show next, but for now we may 
note the discriminants agree to within square factors! 

18 . Algorithm for identifying field generators. We can explicitly link Y, the root 
of /a, with Xy the root of fi by a construction using the following: 

18.1 Main Algorithm. Let Kjk be an extension field of degree n defined by ( where 
(18.1a) (A(e) =) + +--- + ^o =0,(yl, e/b). 



Let some other element rj be defined by the equation 

( 18 . 1 b) =) BnV" + + • • • + Bo = 0, (B, e k). 

It is required to discover if q lies in K, and if so to express q in terms of ^ as a polynomial 
of degree n ~ 1, 

(18.1c) r, = C„_ir~‘+--- + Co (=C(0), {Cjek). 

We look for the n — 1 unknown coefficients Cj. Note the converse process, finding B[q) 
given A(f) and C(f), is taken care of by 

(18. Id) ^[“n) = ^-resultant of A(f) and q — C({). 

So even with C(() unknown, the ^-resultant of A(f) and q — C(() is a polynomial 
R(q) of degree n whose coefficients are to be proportional to B{q). By comparing the 
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coefficients of R{rj) and B{rj) we have n — 1 equations Ti = 0 for t = 1 , • • • ,n — 1 (in the 
n — 1 unknowns Cj and the knowns Aj, Bj). Here the T{ are of degree i in the unknowns 
Ci- 

Then the unknowns Cj are found by the use of resultants for successive elimination. 

This process is tolerably easy where n = 3. We restrict further remarks to this case for 
simplicity. For instance, T\ is linear, so Cq can be solved in terms of Ci and C 2 . Then T 2 
is quadratic and Tz is cubic in the unknowns C\ and Cz- So the Ci-resultant is a sixth 
degree polynomial in C 2 , which we factor over k. In fact, (note the degrees), 

6699 22 236 12 

(18. le) C 7 i-resultant(r 2 , T 3 ) = R{^C 2 \AQ,A\^A 2 ’,AzyBo^Bi^B 2 tBz) = 0 (n = 3). 

If a solution (Co, Ci, C 2 ) does exist, there must be a linear factor giving C 2 . Going back 
we have a linear factor for C\ in Ti, and a formula for Cq, completing the solution. 

For efficiency, it is desirable to store the resultants and equations T,-. For instance, for 
n = 3, we would solve the linear T\ to eliminate (say) Cq in T 2 and T 3 . So we store Ti, 
T 2 , and the Ci -resultant of T 2 and Ts, (a polynomial of degree 6 in C 2 ). 

18.2 Extraneous Solutions. This process will always produce extraneous solutions. 
For instance T 2 is quadratic so it has two roots if it has one. Actually, in the case of a 
normal equation, each of the n roots of B{rj) are expressed by some C(^), but we still have 
n extraneous solutions. So verification of C{rf) is still necessary. 

18.3 MONIC Equations. If we consider the fact that the degrees of A 3 and B 3 are 

exceptionally high in the Ci-resolvent (even for n = 3), it is advantageous to change 
variables before executing the algorithm so as to make f and rj monic, (e.g., f — > and 

rj — ► BnTJt so An — > 1, J5n — > ij. In practice this helps avoid overflow. 

19. Improved parameters. With this algorithm (using k = Q(a:,t/)) we determine 
X e Q{x,y,Y) = K from fz(= A(K)) and fi{= B(X)) as 

(19.1a) 

(X=) X(V;x,i/) = (-(ya:- 1728 + 144y)(y^ + 3y-36)i]/dx 

+ y[(i^y + 21601 - 360yi - 20736y - 81xy^ + 3456y^ + 4y®i - 144y®)y]/dx 
+ r *[(yx + 3i + 432 - 36y) (y + 4)y=*]/<ix, 

(19.1b) 

dx = (yx + 432 - 72y + 3y^)x. 

At the expense of much more complicated relations, we now use the single- valued nota- 
tion, in which the extended modular function field is birationcdly parametrized as 



(19.2a) Q(x,y,o) = Q(s,y) 

by introducing the new parameter 




127 



(19.2b) 



(s =) s{a,x,y) 



g + 4(y + 4)(35y+ 108) 
X + 200y -h 864 



Conversely, x = Xg{s,y) and a = a 5 (s,i/) where 



(19.3a) 

(19.3b) 



16(25y + 108)6^ - 16(y + 4)(35y + 108)s + 4(y + 4)(49y^ -f- 288y + 432) 

y(y + 4)-2s2 

-8(y -f 4)(35y -f 108)s^ + 36(lly + 12)(y + 4)^s - 4y(y + 4)^(35y + 108) 

y(y H- 4) - 2 s^ 



The derivation (see Muller [15]) comes from the interpretation of the equation (15.3a) 
for as a conic in a, x over Q(y). Then by the “method of Pythagoras,” all we need for 
parametrization is a rational point in Q(y), namely 



(19.4a) ao = -4(y + 4)(35y + 108), xq = -200y - 864. 



Then the slope is the new parameter 



(19.4b) s = (a - ao)/{x - xo). 

Now, the multiplication on (r, r') by (2 -f \/2,2 — y/2) (and conjugates) leads to a three- 
valued mapping of 

(19.4c) (x,y,a,5)‘ (X,y,^,5). 

In terms of the one-valued parametrization we want just the mapping 



(19.4d) 



(s,j/)‘ (5,y). 



We pay for the “better” parametrization by more complicated relations. The cubic /s 
which defines Y now becomes 

(19.5a) f3,{Y-,i,y) = + Y2Y'^ + Y^Y + Yo = 0, 

with the more formidable coefficients 

(19.5b) 

^3 = y^(y + 4)(y^ -I- 4y - 2s^)^, 
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(19.5c) 

Yi = -8y*(y + 4)(y* + 4y - 2s^)(24y^ + 150y + 216 - 70sy - 216s + 51s=*), 
(19.5d) 

y, = -4y(5y - 108)(y^ + 4y - 2s^) 

(49y® - 140sy* + 484y^ + 1584y - 992sy + lOOs^y - 1728s + 1728 + 432s^), 

(I9.5e) 

Yo = 16(49y^ + 520y® - 140y®s + 1296y* + lOOy^s* - 992sy^ - 1728sy 
+ 360s*y + 864s^)(49y® - 140sy^ + 484y* + 1584y - 992sy 
+ 100s*y - 1728s + 1728 + 432s^). 



Note that the equation for X{Y',x,y) can now be rewritten by substitution 2 is X,{Y-,s,y), 
with similar loss of simplicity. 

Given s and y, for each of these three (generic) values of Y there must be one 5(s, y; y), 
so we have the three-valued correspondence (s,y)* {S,Y) precisely formulated (gener- 

ically). This is a much larger computation. The “total” equations are too long to be 
manageable (or even instructive), but we outline the component steps. 

First, we define p as the ratio A/a = a{X, Y)/a{x, y), i.e.. 



(19.6a) 



(y + 4)(X^Y - 1728X - 288XY - 1024y^ + 4XY^) 
(y -h 4){x^y - 1728a: - 288iy - 1024y^ + 4iy2) 



If we substitute X = X(Y\x,y) then the y-resultant of the above equation with fa yields 
a cubic in p^. This equation factors into two cubics (trivially equivalent under p *-> — p): 

(19.6b) 

0 = po + PiP + paP^ + PsP^, where 
(19.6c) 

Po = (xy- 1728-|-144y)(i*y^-h2i^y-904iy*-9920a:y- 16000x-|- 165888y^), 
(I9.6d) 

Pi = y(-746496y^ + 2y®x® - 1055y®i^ -|- 9289728y^ - 7741440y®, 

- 4032iy‘* -I- 8x^y* - 14008i*y^ - 3365632xy^ - 213184y®i 

- 90408961 - 23280x^y - 8294400x + 4i®y^), 

(I9.6d) 

Pa = — y^(— 2xy + 480 -t- 464y + 54y^ — xy^)(i^y — 1728x — 288iy — 1024y^ -|- 4iy^), 
(19.6e) 

P 3 = y®(y + 4)^(x^y- 1728x- 288xy- 1024y^ -f 4xy^). 

We note (with some relief!) that no radicals (like y/2) were needed for factorization leading 
to this equation! 
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By applying the Main Algorithm again, we express p directly in Q(F, i,J/) as 



(19.7a) 

(p =) p{Y-,x,y) = [(yx - 1728 + 144y)(4y®x - 576y® + i^y^ - 174iy^ 

+ 6912y* + i^y + 576yx + 4320i))/dp 

- r|2y(-1488y'‘ + 8xy^ + 2y®x* - 643y®i + 38976y® - 292608y* 

- 51i=*y* + 8252xy* + 470016y - 108i^y - 11664yi - 139968i)]/cip 

- y*[y*(x^y* + 2x^y - 432x - 235008 - 760yi - 12672y - 211iy* 

+ 2400y* + 4y^i - 744y®)]/dp, 

(19.7b) 

dp =y(x*y - 1728x - 288yx - 1024y^ + 4iy^)(yx + 432 - 72y + 3y*). 



The s,y parameters can now be made beisic. Since p £ Q(y, i,y) C Q(s,Y,y), the same 
can now be said for A = ap and finally for 



(19.8a) 



(S =) 5(K;s,y) 



A + 4(y + 4)(3sy + 108) 
X + 200Y + 864 



(In fact we use a = a,(s,y),X = X(Y;x,y) and i = Xa(s,y) to obtain S as the ratio of 
two quadratics in Y with coefficients in Q(s,y]). 

The “final” result, by another use of the algorithm, is 



(19.8b) 5 = So(s,y) + y5j(s,y)+y^52(s,y), 



which would, unfortunately, involve unmanageable polynomials of Q(s,y) in the rational 
coefficients Si{s,y). We content ourselves with the implicit result. 



20. Fixed points as illustrations. The fixed points of the system (16.2a-d), where 
(x,y) = (X^Y) (see Cohn [4]), lead to the following table: 



X = X 


II 


a = A 


A/a (= p) 


576 


12 


+3072 


1 


V 


n 


-3072 


1 


117+\/l7 

2 


-3-3^/^7 

2 


^(l±^)16(4_v/i7)9 


1 


n 


» 




1 


117-n/I7 

2 


-3+3n/T7 

2 


^(6^)16(4^^)9 


1 


n 


n 




1 


-64 


-4 


0 


-1 



s = S 
48/5 
8 

8-24v/r7-H9N/2 + \/34 
20 

8-24^/^7-9^/2-^/34 

20 

8-f24N/l7+9x/2-N/34 

20 

8 + 24\/r7-9\/2 + N/^ 
20 



There are generally two values for s{= S) corresponding to the two roots ±a(x,y). Note 
the role of conjugates in Q(\/2, \/TT). Also note in the last case, that the locus a = A = 0 
can be a point of indeterminacy for some parametrizations. 
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21. Remark on algebraic invariants. Something sorely lacking in this exposition 
is explicit algebraic expressions for the Hilbert modular invariants, (e.g., x,y,a, 5, etc.,) 
in terms of coefficients of the hyperelliptic equation, analogously with classical elliptic 
formulas for j, such as 



(21.1a) 






1728 • 

4^13 + 27 ^ 2 ' 



Thus j is seen directly as an elliptic curve parameter, e.g., 

— 27 ? 

(21.1b) B = ^ arbitrary). 

Armand Brumer has suggested that corresponding relations should be known from spe- 
cialization of the Siegel modular functions, but it would be helpful to have explicit results 
in palatable form. 




131 



REFERENCES 

1. J.-B. Bost and J.-F. Mestre, Moyenne arithmetico-geometrique et periodes des courbes 
de genre 1 et 2, Gazette des Mathematiciens (Soc. Math, de France) (1988), 36-64. 

. 2. A. Brumer, (Unpublished communication, March 1997), 

3. J.W.S. Cassels and E.V. Flynn, “Prolegomena to a Middlebrow Arithmetic of Curves 
of Genus 2,” Cambridge Univ. Press, 1996. 

4. H. Cohn, An explicit modular equation in two variables and HilberUs twelfth problem. 
Math, of Comput. 38 (1982), 227-236. 

5. H. Cohn, Introductory remarks on complex multiplication, Internal. Journ. of Math, 
and Math. Sci. 5 (1982), 675-690. 

6. K.B. Gundlach, Die Bestimmung der Funktionen zu einigen Hilbertschen Modulgrup- 
pen, Journ. fur die reine und angewandte Math. 220 (1965), 109-153. 

7. E. Hecke, Hohere Modulfunktionen und ihre Anwendung an der Zahlentheorie, Math. 
Annalen 71 (1912), 1-37. 

8. F. Hirzebruch, Hilbert modular surfaces, Enseignement Mathematique 19 (1973), 
183-281. 

9. E. Howe, Constructing distinct curves with isomorphic Jacobians, Journ. of Number 
Theory 56 (1996), 381-390. 

10. G. Humbert, Sur les fonctions abeliennes singulieres /, Journ. de Math. (5) 5 (1899), 
233-350. 

11. J. Igusa, Arithmetic variety of moduli for genus two. Annals of Math. 72 (1960), 
612-649. 

12. L. Konigsberger, Uber die Transformation der Abelschen Functionen erster Ordnung, 
Journ. fur die reine und angewandte Math. 64 (1865), 17-42. 

13. J.-F. Mestre, Families de courbes hyperelliptiques a multiplications reelles,. Arithmetic 
algebraic geometry (1989), 193-208; Prog. Math., 89, Birkhauser, Boston 1991. 

14. J.-F. Mestre, Courbes hyperelliptiques a multiplications reelles, Seminaire de Theorie 
des Nombres, Univ. Bordeaux I, Talence (1987-1988), Exp. 34, 6 pp.. 

15. R. Muller, Hilbertsche Modulformen und Modulfunktionen zu Q(\/8), Math. Annalen 
266 (1983), 83-103. 

16. S. Nagaoka, On Hilbert modular forms, Proc. Japan Acad. Ser. A 59 (1983), 346-348. 

17. B. Poonen, Computational aspects of curves of genus at least 2, Algorithmic Number 
Theory (ANTS II, H. Cohen, ed.) (1996), 283-306; Second International Symposium, 
Talence, France; Springer- Verlag (Lecture Notes in Computer Science 1122). 

18. H.L. Resnikoff, Singular Kummer surfaces and Hilbert modular forms, Rice Univ. 
Studies 59 (1973), 109-121. 

19. K. Rohn, Transformation der hyper elliptischen Functionen p = 2 und ihre Bedeutung 
fur die Kummersche Fldche, Math. Annalen 15 (1879), 315-354. 



IDA Center for Computing Sciences 
Bowie, MD 20715-4300 
email: hcohn@super.org 




HUMBERT’S CONIC MODEL AND THE RUMMER SURFACE 

Harvey Cohn 



Abstract. In his Cours d'Analyse in 1904, Georges Humbert used the parametrisation of 
a pencil of conics through four points by the Weierstrass p-function to prove theorems of 
geometry and mechanics. This method is implicit in his earlier applications of Kummer 
surfaces, for instance his criterion for real multiplication by \/2 uses the special "quarter- 
period" configuration in the pencil. 

0. Introduction. This talk is intended to relate two topics found in work of Georges 
Humbert (1859-1921). The first is an esisy but unconventional parametrization of the conic 
(Part I), and the second is a conventional parametrization of the Kummer surface (Part 
II). The interrelation of these two topics is thinly sketched for the special goal of real 
multiplication by (Part III). All this is over the field C. 

Some of the best work on the parametrization of the Kummer surface was done by 
Humbert [8] in 1895. It represented the highest state of art in the use of theta-functions 
(which has survived intact) and the algebraic geometry of surfaces (which has transformed 
itself almost beyond recognition). Indeed, more modern treatments (see [3]) tend to use 
purely algebraic tools altogether, rejecting theta-functions. Some very readable accounts 
of hyperelliptic (genus two) Jacobian manifolds are given in [14] and [18], and valuable 
bibliographies appear in [3] and [16]. 

After his advanced work, Humbert published an elliptic parametrization of the family 
of conics through four given points [lO]. This tool has remained largely neglected despite 
the combination of its simplistic appeal (see [4], [13]) and relevance to the more prized 
real multiplication (see [2], [19]). 

PART I. ELLIPTICAL PARAMETERS ON A CONIC PENCIL 

1. Two theorems. Humbert [10] displayed two enticing results: 

1.1 Euler. Given two circles in the plane, one containing the other. It is generally 
impossible to construct a triangle which is inscribed in the larger and circumscribes the 
smaller, but when this is possible there is a continuum of such triangles. 

The obvious trivial Ccise is where the circles are concentric. Only when the ratio of radii 
is \/S will there be such a triangle, and of course the configuration rotates. 

While Euler simply counted parameters, Poncelet constructed the solution by involutions 
of point sets. Jacobi used the elliptic parametrization we shall discuss in the next section. 
The next theorem provides his special application. 

1.2 Jacobi. Consider a weight following a circular pendulum with position P{t) in the 
plane of the pendulum at time t. Then for any (real) constant z, the chord connecting P{z) 
and P{t + z) will have as envelope a circular arc joining the high points of the pendulum 
path. 
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2. Parametrizations of a conic. For simplicity we agree that we are in the field C 
and conics are understood to be nondegererate, (not consisting of lines). 

In the projective plane, to begin with, all conics are the same and all possible birational 
parametrizations are equivalent (affinely) to the simplest imaginable, 

(2.1a) X = t, y = . 

Actually, the most general parametrization is 
(2.1b) 

y :x:l 

— -h CLl2^ “f" <^13) ’ (<^21^^ “h 022^ “h <^23) • (<^31^^ ^32^ + ^33)» 

with matrix form (“^r” for transpose), 

(2.1c) (S/,I,1)‘''= M (^^t,l), M = (a.y), det Af 0 

It is clear that if Af~* acts on both sides, we get the new coordinates for (2.1a), 

(2.1d) (y', *',!)*’■ = Af-‘ (y,x,l) = (t",t,l). 

Now by contrast, we create a unirational (two-to-one) parametrization of an arbitrary 
conic section. We consider a (complex) elliptic curve in Weierstrass form, 

(2.2a) j/^ = 4(x-ei)(z-e2)(a:-€3), (z = p(u), y = p'(u)), 

(2.2b) e, 7^ ey(« ^ j); 61 + 62 + 63 = 0. 

For uniform notation, we write the parameters 

(2.2c) {to,h,t 2 ,t 3 } = {00,61,62,63}. 

We recall the periods and values at the half-periods are 
(2.2d) {2iyi,2t/;2,2iy3(= 2iy2 4- 2u;i)}, 

(2.2e) p(wj) = 6y, p'[wj) = 0, (y= 1,2,3). 

The period lattice is 

(2.2f) n = {2niiyi 4- 2fi2ti;2}i (ni,n2 € Z). 

Remembering that t is the birational parameter of the conic (2.1b) in the (projective 
complex) xj^plane, we replace the parameter by u (writing “[u]” symbolically for the point 
(x,j/)) as follows: 

(2.3a) [u] : (z,y) = (x(i),y(f)) for t = p{u). 

Since go(u) = jp(— u), we have the two-to-one correspondence 

(2.3b) [u] (x,y), [-u] -► (x,y); (x,y) {[u],(-u]>. 

If two symbols [u] and [z] were “added” we would obtain only two chords from [u] to 
[±u± z]y namely 

(2.3c) {chord ([ti][u 4- 2^]), chord ([w][i^ — 2:])}. 

(The term “chord” will also refer to the corresponding secant). 

Above all, the values of [u] are periodic (2.2c) which makes for even more interesting 
results. Hence the chords in (2.3c) are distinct unless u or 2: is a half-period, (2u or 2 z lies 
in n). 




135 



3. Pencils of conics. 

3.1 Main Theorem. Construction of the pencil of conics through four 

POINTS OF A GIVEN CONIC. Given a conic C(0) with four designated base points in 
parameter t = bet us introduce an elliptic function t = fp(u) such that those 

four points correspond (uniquely) tou = 0 ,wijW 2 ,vj 3 (with the periods as described earlier 
in (2. 2d)), Then each point of C{0) is described generically as [u] and [— u]. 

Let z be SL fixed parameter and draw the chords (secants) connecting [u] with [u + js] and 
connecting [u] with [u — z\, assuming z is not a half-period. Then the chords vary with u 
sweeping out a conic C(z) as their envelope with points of contact P and —P (see Figure 
1 ). This is the most general conic of the pencil. 

Finally^ for z a half-period, wi,W 2 ,W 3 , C{z) is one of three limiting degenerate conics, 
namely the diagonals of the complete quadrangle determined by the base points. 

The essence of the construction is that the envelope of the chords is a curve from 
which we can draw two tangents from an external point, hence a conic C{z). When 
u G {0,u;i,u;2,iy3}, then there is only one tangent, which corresponds to the four base 
points (of intersection of C(0) with C{z)). 

Let us next consider the three degenerate conics of the pencil, C(iy») with intersecting 
lines at A{ (as in Figure 1). Here, A{ is the intersection of the chords (of C7(iy»)) from [0] 
to [tz;,] with (cyclically) the chord from [itZj+i] to [1^1+2]. Also the points [u] and [u + iy»] 
are in involution. This involution is created from the requirement that the chord from [u] 
to [u -h W{] must go through A{, and indeed C[wi) is the “degenerate envelope” of these 
chords. 

According to this same involution, the lines from A» which are tangent to C(0), (not 
shown in Figure 1), must occur where ±u = u + mod Cl, i.e., at quarter-periods, viz., 
±u;i/2,±(iyt There are six of them, identified with the tangencies on C(0) from 

A\,A 2 and the imaginary tangencies from A3, (using the notation of Figure 1). 

3.2 Corollary. Any given pair of nontangent conics, (not equivalent to concentric 
circles), can be embedded in a pencil so as to be C(0) and C(z) for a proper choice of 
parameter z. 

The cases considered are those where the conics have four distinct points of intersection, 
which become the base points. Of course all points are projective-complex. 

4. Proofs for §1. For Theorem 1.1, we consider two circles (or any other conics indeed) 
as C(0) and C{z) of a pencil. Then a triangle inscribed in C(0) and circumscribing C{z) 
must have on C (0) the vertices 

(4.1a) S = {[u], (u + z], |u-z]}, 

and these must be the same if we start from u = u-f-2;oru = u — z. Easily, [u~\-2z] — [u — z], 
etc., so 

(4.1b) 3z = 0 mod Cl. 

Thus we can “rotate” the configuration by making u vary. 
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Figure 1 , Generic red case 

There is a similar result for an n-gon rather than a triajigle. It corresponds to n-points 
of division of the period structure 0. It would be dependent on the parity of n, and, worse, 
it should require a painful sorting out of reentrant polygons. We do not pursue this matter 
here (see [10]). 

Theorem 1.2 is again relegated to the literature [lO] with the remark that the spherical 
pendulum is parametrized by the g£>-function. 

5. The quarter-period configuratioii of the pencil. We consider the special case 
of a primitive quarter-period 

(5.1) Az e n, { 2 z ^ n). 

This is shown in Figure 2, where two base-points are real and two imaginary. 

Here, [u] creates its pair [u ± z] but [u -f 2z\ creates the same pair, (Remember Zz = 
—z mod n). If we look at the four points just mentioned, the chords produce the following 
tangencies at C{z): 
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chord ([u], [u z]) is tangent at “5” 
chord ([u], \u — z\) is tangent at “6” 
chord ([u -h 2z], [u z]) is tangent at “4” 
chord([u + 2z]j [u — z]) is tangent at “3.” 

As we saw earlier, since 2z is a half-period, the chords between [u], [u+2^] and [u-f «], 
go through A 2 (the center of the degenerate conic). 

Additionally, if u = z/2 or Zz/2^ one chord joins them and another becomes tangent to 
C(0). This produces two points of tangency of C{z), written as “1” and “2.” 

These six points of tangency on C[z) are a very valuable set. We first ask how many 
degrees of freedom are in the set. There is one module in the four base points on C'(O), 
essentially the y-invariant of the elliptic curve (2.2a). There is one other module, the value 
of u. Let Si be the (biunique) t-parameter of the tangent point “ 1 ” on C{z). Then Humbert 
[ 9 ] showed that the hyperelliptic curve 

(5.2a) = (x - si)(x - S 2 )(x - S 3 )(x - S 4 )(x - Ss)(x - se) 

has modules creating real multiplication by \/2, or otherwise expressed, modules trans- 
formed by the Hilbert modular function for (see [6]). 

In more convenient parameters (see [4]), the quintic (5.2a) becomes 

(5.2b) = x(x — l)(x — s^)(x — ai)(x — a 2 )f 



(5.2c) 



ai 



r - rs + + s _ r -h rs + s^ — s 

H- rs -f- s2 — 5 ’ - rs -h -f 5 * 



6. The pencil as a symmetrized period manifold. The periodic structure C/f2 
(the parallelogram) is the exact image of the elliptic curve (2.2a). Now C(z) is by definition 
periodic in z, but also it is symmetric, so 

(6.1a) C(z) = C(z + 2wi) = C(-z). 

So the usual parallogram is reduced by several identifications. Generally, from (6.1a), 
(6.1b) C(wi + z) = C(wi-z). 

So there are centers of symmetry at each of {0, it;i, it; 2 > 1 ^ 3 }- It is easy to see that under 
the symmetry the manifold for z namely (C/n)/{±u} is of genus 0. Correspondingly, the 
manifold of [u] can be the (birationally parametrizable) conic. 
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PART II. THE KUMMER SURFACE 

7. The symmetrized Jacobian. We make the generalization from elliptic functions 
(of one variable and two independent periods) to hyperdliptic functions of two variables 
with four periods. We restrict ourselves to period systems reducible to the columns of the 
so-called Riemann matrix 

(’•“) - (5 ? ; 0 ' 

with three complex parameters a, 6 , c whose imaginary parts form a positive definite matrix 
inside (7.1a), 

(7.1b) -\-2b^rj crj^) > 0 , for real (^, r;) ^ ( 0 , 0 ). 

Such period matrices come from the periods of hyperelliptic integrals (of genus 2 ). The 
quadruple periods form a lattice L in (with complex variables u and v), viz., 

(7.1c) Periods of (“) = ■i' = ( j) + (j) + ( 5 ) + (j)}i 

(7.1d) Gi, Gj, Hu H 2 e Z. 

The manifold of C^/L is a period parallelopiped in two complex dimensions. It is called 
a “Jacobian” manifold because of its original application (see [14]) where u and v are 
hyperelliptic abelian integrals of the first kind with the periods shown. 

Functions defined on it are quadruply periodic (meromorphic) functions, which may 
be represented as the quotients of special holomorphic functions. These are not quite 
quadruply periodic but have exponential periodicity factors of a special form. They are 
called normal theta-functions of order N (see [5]) and are defined as a linear space (over 
C) by the period relations 

(7.2a) ^at(u,v) = ^/v(u + l,t;) = ^^r(u, v -f- 1), 

61//(u,v) = exp 27 t[i(u + a/2)7V] ^i\^(u + a, V -h 6 ) 

(7.2b) = exp 27r[t(t; + c/2)7V] ^ 7 \^(u + 6 , V H- c). 

Special attention is given to even theta-functions defined by 

(7.2c) Bn[u,v) = Osi-u^-v). 

It can be shown, a priori, that the space of normal theta-functions is of dimension 
and the subspace of even functions is of dimension + 4 ) /2 when N is even. (Odd 
theta-functions are the complementary subspace; they have a (— 1 ) factor in (7.2c), but 
they are not needed here). 
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Analogously with elliptic functions, we say the jacobian manifold is C^/L. The sym- 
metrized jacobian manifold is (C^/L)/{±(u, v)}; it is the object that corresponds to a 
Kummer surface. 

8. Formulas for theta-functions. We give only the briefest survey, with minimal 
proofs. 

The best known example of a normal theta function is of order N = 1: 

— oo,+oo 

(8.1a) «(u,v) = exp 27ri[{mu + nv) + <^(m, n) /2] 

m,n 



(8.1b) <t>[m^n) = am^ + 26mn -f cn^. 

Trivially, 6{u^v)^ is of order N but more sophisticated examples are needed. 

We must construct 16 auxiliary theta-functions of (order one) but with characteristics 
defined by the quadruples: 

(8.2a) 



This introduces signs into the relations (7.2ab). We define 



(8.2b) 



© 



f 9i 92\ fu\ 

\hi h2j\vj 



— 00, + CX5 

^ exp 27Tt[( Y + m)u+( Y + n)v + 

m,n 



mhi + n/i 2 
2 






If the characteristics are all 0, then of course we are back to (8.1a). (It is possible to define 
such functions as in (8.2b) with order n, but they are not needed here, compare [14]). 

The modified period and symmetry relations are 

®(2‘. g)(:) 

(8.3b) 

«(*; K)(:) K)(:::) 

K" K)(:) = (-i)-— e(«; «)(::). 



(8.3c) 
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The squares of these functions are all normal even functions of order 2. 

The important fact about these 16 functions is that they have as roots the 16 half-periods 
in some combination. From (7.1c) these 16 half-periods are A/2,^/2 where 

(8.4a) A = Gi -}- Hia *+- H 2 ^i M ” ^2 

where, from now on, 



(8.4b) GuHuG2,H2 €{0,1}. 

By combining all the identities (8.3abc), we find 

£)(: 

« ttt- / ^ ^ + ^2^2 “I" "I” ^2-^2 

X exp 27 ri[JETi(u + + -) + H2{v -H <^2 + ~) H ^ 

We look for roots of the theta-functions among the half-periods (uq, vo) with 
(8.4d) uo = — A/2, Vo = — m/2. 



By substituting these values in (8.4d) and using symmetry (8.3c), we find 



(8.4e) 







with multiplier reminiscent of (8.3c), 

(8.4f) € = exp 7Ti[(^i + /Ti)(/ii + C7i) + (^2 + ^2)(^2 4- C2)]. 



Of course both sides of (8.4e) vanish if e = — 1. So for each of the sixteen characteristics 
there corresponds of six roots (i.e., half-periods): 

(8.5a) [g\, 92 ,hi,h 2 ) (Gi, G2, ifi, 1^2) (one-to-six) 

for which e = — 1. Actually, the six values follow modulo 2 from the columns: 

0 10 111 
10 10 11 
0 1110 1’ 

10 1110 

We later verify that half-periods may occur as roots only from the above table. This is 
not a trivial observation, since a function of two variables has as its roots a submanifold 
of dimension one. 

It is convenient to use the dyadic representation for the characteristics of each theta- 
function 



(8.5b) 



91 H- Hi 

92 + H 2 
hi -h Gi 

/12 4 “ G2 
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(8.5c) j = Sgi + 4^2 "I* 2hi H- / 12 , (0 ^ J ^ 15), 

and for each of the half-periods 

(8.5d) J = 8^1 +4fT2 + 2Gi + C2, (0< J<15). 

So from (8.5b) each j engenders six values of J, denoted for later convenience by the set 
Cy: 

(8.5e) Cj = {J} : j J, (one-to-six). 

This relation is shown in the columns showing six values of J for each Cji 

8.6 The Rummer table of theta- functions and roots. 



Co 


Cl 


C 2 


C 3 


C 4 


Cs 


Ce 


C 7 


Cs 


C 9 


Cio 


Cii 


C 12 


Ci3 


Ci4 


Cis 


5 


4 


7 


6 


1 


0 


3 


2 


13 


12 


15 


14 


9 


8 


11 


10 


10 


11 


8 


9 


14 


15 


12 


13 


2 


3 


0 


1 


6 


7 


4 


5 


7 


6 


5 


4 


3 


2 


1 


0 


15 


14 


13 


12 


11 


10 


9 


8 


11 


10 


9 


8 


15 


14 


13 


12 


3 


2 


1 


0 


7 


6 


5 


4 


13 


12 


15 


14 


9 


8 


11 


10 


5 


4 


7 


6 


1 


0 


3 


2 


14 


15 


12 


13 


10 


11 


8 


9 


6 


7 


4 


5 


2 


3 


0 


1 



8.7 GOPEL QUADRUPLES OF THETA- FUNCTIONS. These are unordered sets of four 
distinct (ordered) quadruples of characteristics 

(8.7a) (r = l,--.4) 

such that (summing over r in each quadruple 

(8.7b) = = 

= = 0 mod 2. 

They have the property that sign factors of (8.3abc) cancel in the product of four theta- 
functions of these characteristics. (There are 60 such sets). 

Thus for the product of elements of a Gopel quadruple, (say) 

(8.7c) ^ap'^6 — 

is a normal even theta-function of order N = A. 

9. The Kummer surface. We start with a Gopel quadruple. Since there are only 
10 linearly independent even normal functions of order four, some relation exists with 
constant coefficients among the 11 indicated functions: 

e„0^0^es = 

a a,0 



(9.1a) 
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If we introduce the variables 

(9.1b) x = @l, y = Qp, z = ©^, w = ©I, 

then we square (9.1a) to obtain a homogeneous equation of degree four 

(9.1c) xyzw = P(x,y,z,iy)^, 

(with P(x,y,2, ly) the homogeneous form of degree two from (9.1a)). This equation rep- 
resents the Kummer surface, which is important because it is isomorphic to the jacohtan 
modulo even symmetry (u,v) — ► (— u,— v). This lost result is essentially an easy conse- 
quence of the degree four (as we shall later note) . 

Actually, the Kummer surface has simple singularities, with the neighborhood of a point 
looking like a cone (rather than a plane). This condition leads to the form (see [17]) 

P{x,y,Zyw) = [x^ + + z^ 

(9.1d) + 2/(y^ + xw) -h 2g{zx + yw) + 2h{xy + ziy)]const. 

The roots (8.4a) of the theta-functions, seen previously to be half-periods, are now 
also seen to be the singularities of the Kummer surface. Actually, this follows from the 
symmetry about the half-periods in (8.4d), 

(9.1e) /(u,i») = /(— u,— v) = /(u-HA,v+/i) => /(A/2-f-u, /i/2+v) = /(A/2— u, /i/2— v). 

This is more complicated than symmetry in one variable. It means that /(A/2 + u,/i/2-hv) 
has an expansion in terms of u^, and (additionally) uv. So a change of variable is not 
enough to remove the singularity (as in the case of a Riemann surface with a branch point). 

9.2 The Kummer Configuration. The sixteen singular points lie in sixteen conics 
(also denoted by Cj ) with six points to a conic as shown in the Kummer table. Each point 
lies on six conics. 

Each conic lies in a different plane, so that each pair of conics intersects in two singu- 
larities. This creates a correspondence of each of the 120 pairs of singular points to a pair 
of (intersecting) conics. 

Typical correspondences through intersections are shown here. Note from Kummer’s 
table the reciprocity of the relations 

(9.2a) Co n Cl {10, 11}, {0, 1} - Cio n Cii. 

Also, for any conic (say Co), each of its fifteen point-pairs (diagonals) will account for 
the intersections with the fifteen other conics. For instance, the point-pair {11,13} in Co 
accounts for Cq, i.e., {11, 13} — ► Co n Ce. 

The 60 Gopel (unordered) quadruples of characteristics (i.e., of theta-functions) become 
quadruples of conics {C,-, Cy, Cjt, C/} for which, (taken in any or every order). 



(9.2b) 



Ci = (C. ncy) u (c.-nCfc) u (c»nc,). 
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Thus {Co, Cl, Cio, Cii} is such an example: 

Con Cl = {10,11}, ConCio = {7,13>, CoHCh = {5,14}, 

(9.2c) CiHCio = {4,15}, CiHCii = {6,12}, CioHCn = {0,1}. 

Otherwise expressed, there are 6 singularities on each of the four conics, but in total there 
are only 12, as they each appear twice. (Compare [7] for further properties). 

A Gopel quadruple of conics (9.2b) has the equivalent property that the singularities 
labelled {t,y, A:, /} have no subset of three lying on one conic. 

10. Some typical proofs. The original theoretical development was highly intuitive 
from Riemann to Poincare [15] and even beyond. Much of the flavor of the subject was 
lost by modern standards of rigor (see [11]). 

10.1 Poincare. A theta-function of order N and one of order M have 2MN roots in 
common (or inGnitely many). 

As a limiting case in (7.1a), when 6 0, we have two periodic systems one for u 

of periods (l,a) and one for v of periods (l,c). The theta>functions break up into the 
product of two of the familiar elliptic type, and the simultaneous roots in the limiting caise 
might come from 

(10.1a) 

Each (or 0m) has N (resp. M) roots, so we make the choice: 

(10.1b) ^ 7 \J^(i^) = = 0 versus 0j^\v) = 0 m (u) = 0. 

With either choice we have MN common roots, hence the result. The counting formula 
must hold true for 6 0 by continuity. 

As a consequence, any of the theta-functions of order one in (8.2b) indexed by j has only 
six of the roots indexed by J as shown in Rummer’s table (no other half-periods). This is 
true because Rummer’s table is an exact accounting of (half-period) roots, i.e., two such 
functions have only two roots in common, while any two columns share only two roots. 

As a second consequence, the Rummer surface (9.1c) is an isomorphic image of the 
symmetrized Jacobian, (C^/L)/{±(u, v)}. If we make 

(10. Ic) xjw — const, y/w = const, 

then by (9.1b), this means 

(lO.ld) 0^ - (x/u>)0| = (y/ii>)0| = 0. 

Thus a pair of theta-functions of order two vanishes from (10.1c). Such a pair is satisfled 
only by eight values (u,v) or (by symmetry) four values ±(u, v). Of course the line (10.1c) 
must have exactly four intersections with the Rummer surface (of degree four). This proves 
the desired isomorphism. 
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PART III. ELLIPTIC CONFIGURATIONS ON THE KUMMER SURFACE 

11. Abelian functions and intermediate functions. The central object is the 
abelian function, a meromorphic function /(u, v) (of two complex variables) which has a 
quadruple set of periods, so it is defined on a manifold like C^/L in (7.1c). Specifically, 

(11.1a) (ai,/?i) e {(1,0), (0,1), (a, 6), (6, c)}, 

(11.1b) /(w,v) = /(u + ay,v+/?j), i = !,••• ,4. 

By a theorem of Appell and Poincare [l], such a meromorphic function is globally a quotient 
of holomorphic nonvanishing intermediate functions 0(u,v) with the property 

(11.1c) 6(u -h aj,v + J3j) = 0(u,v) exp(^jU -h SjV -h Cj), j — 1,- • • ,4 

with involved consistency conditions on the 7 y, (which come from the commutativity 

of the period addition, see[12]). 

This structure comes from hyperelliptic curves, like that of (5.2a) if we set 

u = / dx/y -f / dx/y, 

J oo J oo 

ri^',y‘) 

v= xdxfy / xdxjy, 

J oo J oo 

Then the paths on the Riemann surface of H define four independent periods (on say 
C^/L) and symmetric functions such as 

(11.2c) fi = xx\ f 2 = x + x', /a = y-hy', A = yy', 

are abelian functions. We can replace u,u by linear combinations of u,v and choose paths 
cleverly so that we return to the periods in 

‘ (I °i 1 1) ■ 

with three complex parameters a, 6, c (satisfying (7.1b). It has been shown [14] that 
/i»/ 2>/3 are generating functions of the field of abelian functions, while /i,/ 2,/4 (as well 
as x/w, y/w, zfw of (9.1b)) are generators of the subfield of even functions. (Note that the 
change y —y through a different path leaves f^ unchanged while (it,v) — > — (u,v)). 

Special intermediate functions are the theta-functions, which were used to construct 
the Rummer surface. Since an algebraic curve on the Rummer surface is rational in the 
homogeneous coordinates x,y,z,w of (9.1b), it follows that such a curve is determined by 
the zero locus of an intermediate function. 

12. Singular abelian manifolds. The abelian manifold C^/L is called singular when 
the lattice of periods L has a 2 X 2 nonscalar matrix (5 ^ EZ) as an endomorphism, called 
a “complex multiplication” 



(11.2a) 

(11.2b) 
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[3z/2] 



Figure 2, Quarter-period case 

(12.1a) SR = RM 

for M an integral 4x4 matrix. Here S signals a change of basis for (u, v) and M a subbasis 
for the lattice of periods R, 

Generically, (12.1a) is not possible if det(M) ^ ±1. This is similar to complex multipli- 
cation which is nongeneric for elliptic curves over C. In other words there must be a loss 
of free parameters in (11. 2d). For instance, we just take the case where a singular relation 
happens to be 

(12.1b) a = 2c. 

Then the relation (12.1a) becomes valid with 

-a o). -=(o A)- 



(12.1c) 



S 
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This is called a “real” case [9] of complex multiplication since the eigenvalues of S axe real 

(±\/ 2 ). 

To follow through on Humbert’s approach, we note that the intermediate functions 
^(u,v) of (1.1c) may have special consistency conditions on the exponential coefficients. 
Thus we define a singular intermediate function as one whose period relations (11.1c) hold 
only by virtue of a singular relation like (12.1b). An example of such period relations is 

(12.2a) 0{u,v) = ^(u + l,v) = ^(u,v + l), 

(12.2b) 

^(u-ha, v + 6) = ^(u, v) exp 47Ttt;, 6{u b,v c) = 0(u, v) exp 27rtu. 

Now the vanishing locus of a singular intermediate function is a singular algebraic curve 
on the Kummer surface. If everything is viewed from the perspective of one of the singu- 
larities (at the tangent cone) there will be six conics passing through the singularity, lying 
in six planes. These planes are viewed as six lines with 15 intersections (for the remaining 
15 singularities). Some singular curve now will project into a configuration with two conics 
and six lines. 

We shall describe the configuration and skip the details involved in its justification 
(which is unfortunately lengthier than the entire current presentation!) 

We look again at Figure 2. Here the six lines are those dashed lines, actually tangents 
to C{z) paissing through the points numbered 1,2, 3, 4, 5, 6. (The notation has the risk of 
confusing these points with singularities but, actually, only four of the 15 singularities are 
shown, at intersections labeled [u], [u + 2 ], [u — z], [u + 2z\). 

Also C[0] is the projection of some singular curve and the fact that the points 1, • • • ,6 lie 
on six tangents from C[0] causes the points to be special. (Only five constraints determine 
a conic). A further link in this arcane chain of reasoning is that the parameters of the 
hyperelliptic curve represent the six points of tangency. This explains the specialization 
of the hyperelliptic (5.2bc), (see [4] for explicit computation). 

13. Concluding remarks. There is al larger literature for real mutiplication by \/5 
because the conditions on the hyperelliptic curve are more interesting from the standpoint 
of group theory. Here there is a special “fifth-period” configuration which we do not pursue, 
restricting ourselves to the easier case of y/2. These are the only two cases which can be 
treated with any degree of satisfaction as of now (see [2], [9], [19]). 

The author gratefully acknowledges helpful conversations with Armand Brumer. 
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ARITHMETICITY AND THETA CORRESPONDENCE 
ON AN ORTHOGONAL GROUP 



Ze-Li Dou 

To Professor Kenneth B. Kramer 



Introduction 

In this paper we discuss several related topics in the theory of automorphic forms, 
on which the author’s current research is focused. Roughly speaking, we shall deal 
with the questions of arithmeticity regarding special values of certain zeta and L- 
functions, as well as algebraicity results concerning periods and Fourier coefficients 
of certain automorphic forms. 

The author has attempted to maintain a balance among several goals. He nat- 
urally wishes to bring the reader to his current research. On the other hand, 
he considers it an essential part of his task to explain how his work fits into the 
larger program concerning the same topics mentioned above, which was initiated 
by Shimura, and is being intensely pursued by Shimura himself and many others. 
From this point of view, it is not desirable to showcase only the results in one spe- 
cific setting. Finally, as this paper is based on a talk given at the Graduate Center 
of the City University of New York, the author also wishes to address as large an 
audience as is practical. This puts a further constraint on the amount of techni- 
cal material involved, at least initially. Consequently, he has adopted a “gradual 
generalization” approach — the article begins with a motivational section discussing 
the simplest possible setting only, and then each subsequent section introduces a 
generalization in at least one direction. Also, in each of the sections §§2-4, we 
concentrate on only one key concept. Thus in §2 we discuss the arithmeticity of 
automorphic forms; a nearly holomorphic arithmetic function (at critical points) is 
constructed in §3; and the main part of §4 is devoted to the discussion of a theta 
correspondence. These topics are all mutually related, and such relationships are 
pointed out in the discussions in the text. In general terms, we investigate, in 
the setting of a totally real quadratic extension of a totally real algebraic number 
field, a theta correspondence (see §4), and the algebraicity results on periods and 
L- values that follow (such as results of the type mentioned in §2), which can be 
derived with the aid of certain arithmeticity results of functions analogous to the 
f{w^ s) considered in §3. 

With this approach, the author hopes to have given an adequate explanation of 
the essential points of that part of the program in which his research is engaged. 
Admittedly, this is neither the most efficient nor the most “logical” approach — one 

Research for this article was partially supported by NSA Grant MDA904-97-1-0109 and by the 
Texas Christian University Research Foundation. 
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could have started from the setting of §4 and then proceeded backwards, thereby 
eliminating §1 altogether. Also, with this approach this paper cannot contain a 
full list of the author’s results with all their technical precision. However, he views 
his results as ones complementing those of Shimura, in a naturally generalized 
setting. Therefore, an understanding of Shimura’s methodology is crucial. For this 
reason the author considers the tradeoff more than reasonable, in the sense that 
the author’s results can to a certain extent be “inferred” from a understanding of 
Shimura’s theorems in this paper, but not the other way around. Finally, due to 
the limitation of space, no proof has been included in this paper. To compensate 
for these shortcomings, the author has inserted references throughout this paper, 
where precise statements of the latest results can be found, as well as their proofs. 

I would like to thank Toni Bluher for several conversations concerning both the 
content and style of this paper; I have found them extremely helpful. It is also 
my most pleasant duty to thank the editors. Professor Melvyn Nathanson and 
Professors David and Gregory Chudnovsky, for their invitation to give a talk at the 
Graduate Center, and for their subsequent invitation to include this paper in their 
volume. In addition. Professor Josef Dodziuk’s hospitality is responsible for some 
of the fondest memories of my visit to New York City in 1997. 

This paper is dedicated to Professor Ken Kramer of Queens College, as a token 
of my sincere gratitude to him. I am very fortunate to have been a recipient of his 
generous and constant support for well over a decade, starting from my years els an 
undergraduate student. 



1. Two examples of arithmeticity and near holoniorphy 



In this section we shall provide some motivation for the concept of arithmeticity 
by considering two examples in the case of the most classical modular forms. In so 
doing we follow the lead of Shimura. (See [Sh95].) We introduce some notation as 
follows: 

^ G C I Im(;^) >0}, F = Q, 

and 

f / ^ k\ I 

c = 0 (mod N) I 



ro(AT) = |a=(“ J)€SL2(Z) 



A meromorphic function / : H — > C is called an automorphic form of weight k for 
some A: G Z if it satisfies the following condition with respect to the usual action of 
elements a 6 Po(A’) on z e namely. 



where 



We shall write 



/(a^)=i(a, «)*/(«), yaeVo(N), 

def CLZ b 



cz d ' 



and 



j{a,z) cz +d. 



f\\ka(z) = j{a,z) ‘‘/(az). 
Then the above condition becomes simply 

(1) f\\ka = f, WaeToiN). 
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A holomorphic automorphic form / is one which is holomorphic on H and whose 
Fourier expansion at every cusp has the form where h is a pos- 

itive integer. That is, we require / to be holomorphic at every cusp. (Note that 
this requirement is not needed in the case of F ^ Q.) If in fact qq = 0, then 
we say / is a cusp form. The space of all such holomorphic automorphic forms 
is denoted by AdArCroCA’)); its subspace of cusp forms is denoted by Sk(To{N)). 
Both of these spaces are finite dimensional vector spaces, whose dimensions can be 
easily computed via the Riemann-Roch Theorem. To avoid having to refer to a 
specific congruence subgroup such as ro(A'), we take the union of all such spaces 
of holomorphic automorphic forms and denote it simply by A^fc. Similarly we have 
the definition of «Sjt. 

There are commutative, self-adjoint (with respect to the Petersson inner product) 
linear operators acting on the spaces of automorphic forms, which are called Hecke 
operators. As a consequence, there exist common eigenforms with respect to the 
Hecke operators. Also, we have the theory of newforms. These topics are very 
standard, and therefore we will not give any further explanation. A normalized 
Hecke eigenform will be referred to as a primitive form. 

Let (/) be a Dirichlet character defined modulo N. Then a character on ro(iV) 
is given by <^(a) 4>{da)y where da is the (2,2)-entry of a. We may consider the 

elements f of Mk satisfying the condition 

/|U-n = (^(a)/, ^aeToiNy 

Suppose / is a cusp form satisfying this condition. Then we can write 

oo 

n=l 

For every primitive Dirichlet character y, we define a Dirichlet series D(s,/, y) 
follows: 

oo 

D(s,f,x) = 

n=l 

Here s is a complex variable. This series is convergent on a half plane and can be 
analytically continued to tbe entire C-plane. 

We can now state the following theorem: 

Theorem. Suppose k > 1 and f is a primitive Hecke eigenform. There exist two 
periods and p- such that 

€ pJCx, 0 < m < k, 

where r(x) is the Gaussian sum of x, A\- is the field generated over Q by the x(^^) 
and the Fourier coefficients a„ of f , and £ = dol is determined by the relation 
x{-l)e = (-ir. 

For a proof, see for example [Sh76]. Note that there are infinitely many possible 
characters y and hence also infinitely many values D(m,/, y) as described above. 
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Thus the existence of the periods and p_ in the above theorem is remarkable and 
is a manifestation of the arithmeticity of the values of the zeta function £^(3,/, x)- 
Our next example is an Eisenstein series, which in the simplest case is given by 

Ei;(z,s)= ^ <;6(Q)j(a,2)“*(Iin(a«))’, zeH,seC. 

oe(Pnr)\T 

Here we have written ro(A^) as F for brevity, and the meaning of /c,a, </>, and 
(f>{a) are the same as above. The symbol P is defined by P = {a G SL 2 (Q) | 
Ca = 0}, where Cq is the (2, l)-entry of a. This Eisenstein series is convergent for 
k + Re(2s) > n 4 - 1- Also, after we multiply Ek{z,s) by a non-zero entire function, 
the resulting product can be continued to a real analytic function on P x C which 
is holomorphic in s. We are interested in the arithmetic nature of Ek{z^m) for 
certain critical integers. As an example, let us take m = 0,/c = 2, and the trivial 
character </>=!. Then we have 



'' ' n=l ^0<d|n ^ 

Notice that this is not holomorphic. For this reason w'is introduce the notion of 
near holomorphy.^ A function / : H — > C is said to be nearly holomorphic if it 
can be written as a finite sum of expressions of the form p(Im( 2 :)"^ ) 5 f( 2 ), where 
p is a polynomial and ^ is a holomorphic function. We may also speak of nearly 
holomorphic automorphic forms if we add in the automorphy condition (1).^ We 
then say that a nearly holomorphic automorphic form is arithmetic^ or Q-rational^ 
if the Fourier coefficients of such a form are algebraic. Theorems of Shimura, when 
specialized to our example at hand, then show that a modified version of £* 2 ( 2 ^? 0) is a 
nearly holomorphic and arithmetic automorphic form. In fact, Shimura showed that 
it is Qab-rational. For an illuminating discourse on the concepts of arithmeticity 
and near holomorphy, Shimura’s [Sh95] is strongly recommended. Later on in this 
paper, we shall present, in certain more general settings, the results mentioned in 
this section in precisely formulated forms. We shall also see the interconnection of 
such results. 

2. Arithmetic automorphic forms with respect to a quaternion algebra 

In this section we shall define arithmeticity more precisely. As always in this 
paper, we confine our discussion to the orthogonal case. Also, in accordance to 
the plan of “gradual generalization” we have adopted, we shall not operate in the 
most general setting possible. For the general development we refer the reader to 
Shimura’s papers [ShSO], [Sh86], and [Sh87]. (See also §3 of this paper.) In this 

^If we had chosen fc > 2 in this case, the situation would have been simpler. But the occurrence 
of near holomorphy is natural and not sporadic. See [Sh86]. 

^In the setting of our example, we also need to assume that / has a Fourier expansion of the 
form (/|Uct)(z) = exp(27rm2/A''or). This condition is not needed if we 

take a number field different from Q or if the automorphic forms are defined on multiple copies of 
H. 
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section, we shall consider the setting of a quaternion algebra over a totally real 
algebraic number field. The main reference for this section is [ShSlI] and [ShSlII]. 

Let, then, be a totally real algebraic number field with [F : Q] = n, and let 
-B be a quaternion algebra over F. Recall that this means ^ is a central simple 
algebra of dimension 4 over F. We have an isomorphism of the following type: 

(2) B(s>qR^M2(ry X 

where H denotes the ring of Hamilton quaternions. We arrange the archimedean 
primes u, in such a way that B is unramified at , V 2 ? • • • i ^^d ramified at 
Vr-\-i , r^r+ 2 ? • • • We assume that r > 1. Notice that this development allows the 
possibility B = M 2 (F); therefore it includes the Hilbert modular forms as a special 
case. 

Writing A for the adele ring of Q, we let 

Ba = B (8)q a and write Ga = (Ra)^- 

Here the symbol denotes the group of invertible elements of any ring R. Then 
we have Gq = B^ . The infinite part of Ga will be denoted by Goo, which is 
isomorphic to GL 2 (E)^ x We also write the identity component of Goo 

as Goo+- We then define 

G'a+ = {a: G Ga I Xoo € ^'cx>4-} and G(Q-|- = Gq fl Ga+* 

For every 0 < m G Z, there is an R-rational irreducible representation am • 
— y Gm+i(C) of degree m, which is unique up to equivalence. By fixing a 
suitable isomorphism (2), we may assume that the following properties are satisfied: 

€ M 2 (Q), VI < f < r,Va’ G B; 

am{x^,) G M;n+i(Q), Vrd-1 < z < n,Vx G B\ 

CTm(a:*) = ^m(ar), yx G H. 

Here * denotes the main involution of B as well as its natural extension to jBa- If /^ = 
(Kr-f 1 , «r+2, • • • , ^n) G Z”""' such that Kf > 0, then we may define a representation 
(Jk on Ga by 

n 

(^) (*■')• 
i=r+l 

The representation space is which will be denoted by A'. 

We can now define the action of elements of Ga+ on the factor of holomorphy 
j{x^ z) (where ^ G and congruence subgroups of G((^ as natural generalizations 

of the corresponding concepts defined in the previous section. This done, then given 
k £ and a vector valued mapping f : — > C^, where d = nr=r+i(^* 

define another mapping /||a;,k^ of the same type by the following formula: 

i+1 
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If r is a congruence subgroup, then we denote by A^jt,K(r), and <Sjt,/c(r) 

the space of meromorphic, holomorphic, and cusp automorphic forms, respectively, 
with respect to T. That is, we require = / lor all x 6 P. If 5 = M 2 (Q), 

then we also need to impose the usual condition at the cusps. As usual, the union 
of the spaces over all congruence subgroups T is denoted simply by Ak,K- 

The symbols Mk,K and Sk,n are defined in the same way. 

Furthermore, if is a character of F of finite order, then Af jt,K(F, </>) is the 
subspace of Mk,n such that f\\k,K^ = for all a: G F. Similarl}^ we have 

the definition of «Sa:,k(F, (;6). Notice that the distinction between Mk,K and Sk,n is 
unnecessary if .B is a division algebra, because in that case Mk,K = ^k,K’ 

Next we turn our attention to the meaning of arithmeticity for automorphic 
forms. Let K be a totally imaginary quadratic extension of F. Suppose h is 
an F-linear embedding of K into B. Then we have h{K^) C and h(K^) 

has a unique common fixed point on This point is called a CM-point. To 

define arithmeticity, we need to make use of a mapping Pk,K^ which takes values in 
GL(A'), and is constructed by means of a bilinear mapping related to the periods 
of abelian varieties. Since we do not wish in this article to stress the technical 
details, we refer the reader to Shimura’s [ShSO] and [ShSlI] for the definition of 
these mappings. An element / G Ak,n is said to be arithmetic at a CM-point w (at 
which / is holomorphic) if algebraic components. If this is so 

for all such CM-points, then / is called an arithmetic automorphic form. We shall 
use the symbol Q to indicate the set of arithmetic elements. Therefore we speak of 
‘5jt,K(Q),«5;t,K(F,Q),tSfc,K(F,(/>,(U), and so forth. 

We have the following identities: 

Mk,H = Mk^niQ) <S)qC, and Sk,H = <S^-,k(Q) <8>q€. 

For the remainder of this section, we shall describe an algebraicity result concern- 
ing a zeta function. In order to do so, we first need to consider adelic automorphic 
forms, which are defined on Ga- Given an integral ideal c of -F and a character 
0, we can define the space of automorphic forms and its subspace of cusp forms of 
weight (A:,«), level c, and character xp. These spaces will be denoted by Afjt,K(c,0) 
and Elements of such spaces will denoted by bold face letters. In the 

interest of economy of space, we shall not develop the definition of such forms from 
scratch. It turns out that every element f G -Ma:,k(c, 'ip) can be identified as a vector 
(/i, • • • ,A) € riA=i -^it,K(FA,0A) for suitably defined Fa and (?^a, A = 1, . . . , h. We 
refer to, say, [D93] for a full explanation. The meaning of arithmeticity for such 
adelic automorphic forms is a natural generalization of what we have just defined — 
if f G is identified with (/i,. .. ,//»), then f is called arithmetic if all /,• 

are arithmetic for i = 1, . . . , h. 

Assume that F has a subfield E such that [F? : Q] = r and that the restrictions 
of vi,. . . ,Ur to E are all different from one another. Our zeta function involves 
three main ingredients: a locally constant function rj on F, which is Q-valued, a 
Hilbert modular form H of Al/,o(M 2 (F)), and a primitive Hecke eigenform 0 ^ f G 
V^). The automorphic forms are taken among the arithmetic elements. This 
is possible because of the identities mentioned above concerning Sk,K ^^nd «5jt,K(Q), 
etc. Recall, also, that a function rj is locally constant if there exist two lattices L 
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and M in F such that r}{x) = 0 for x ^ L and 77(0:) depends only on x modulo M. 
We can find a subgroup U of 0 ^, where the symbol Of stands for the ring of integers 
in such that ri{ux) = t]{x) for all u E U. We then say that t] is U -invariant. 

Let us take such a locally constant function rj and a Hilbert modular form H as 
above, and write 

aeE 

where eE{ctz) exp(27T2 a^j zj). Here we have denoted the restrictions of v,- 

to E by the same symbols for notational simplicity. Let Ue denote the group of 
totally positive units in E. We then take C/ to be a subgroup of Ue of finite index 
such that both rj and Q are ^/-invariant, and introduce the notation [U] = [Ue • 

As for the primitive Hecke eigenform f, suppose we have f | Tc(a) = ^or 

all a, where the Ta(a) denote the Hecke operators. Then the set {x(^^)} is called a 
primitive system of eigenvalues. The zeta function Z{s) is defined by 

Z(S) = [U] v{b)x{br)u^uiTtF^EitbW-‘^. 

0<^b£F/U 

Here t £ F is such that > 0 for 1 < 7 < r and < 0 for r -f 1 < i < n; j: 
is a fractional ideal of F; c = where the c,- are integers or half-integers; 

= ( 6 ^^,... , 6 ^'')^; and 0Ju{ci) = a“u;(a), where u = uivi -f • • • H- UnVn with 
0 ^ Uj G Z/. 

We can now state a result of Shimura ([ShSlII]) concerning the algebraicity of 
certain critical values of the zeta function Z{s) as follows: 

Theorem. Suppose / -f 2u -f 2ReE/E{^) = 7(^1 + • • • 4- with 7 G Z, 2c,- = ki 
(mod 2) for i < r, K{ < 2c,- = k,- (mod 2) for i > r. Then for every integer 
satisfying 

(i) (^) - 1 + [F : £^] < So < (y) + Ci, 1 / 1 < i < r; and 

(ii) 2soj^j + 2[F :E], if E = Q, 

we have Z(sq) finite, and 

^N)~7rl'=l(g,g). 

X 

Here x ^ y means — G Q, and g is any arithmetic Hecke eigenform belonging to 

y 

the system of eigenvalues {x}* 

Moreover, if in addition we have 

2 c,- -j- A:,- -f* 2 = 7 4* 2[F : VI 2 ^ r, 

then the residue of Z at s = : F] is an algebraic number times g); 

where Re is the regulator ofE. 

The above theorem is certainly rather complicated technically; several more 
theorems of this nature can be stated in this setting, and can be found in [ShSlII], 
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together with detailed proofs. Moreover, theorems of this type have been obtained 
by Shimura in much more general settings, rendering the theorem stated above 
only a special case. For example, we may replace F by a direct sum of totally 
real algebraic number fields. The function /(t/;,s), which we define in the next 
section, will be given in this generality. In any case, for a fuller discussion we 
refer to [Sh83] and [Sh88]. In addition to the intrinsic interest of such results, 
which is evident, there are also important consequences concerning the L-functions 
attached to automorphic forms — indeed, they may be regarded eis special cases 
of the theorems of the above type. (The Theorem in §1 is a simple example of 
this.) Furthermore, these results are also intimately related to the several period 
invariants, also investigated by Shimura, Shimura’s article [ShSS] is one recent 
reference in which all these aspects are discussed in depth. Also in that paper, 
a list of far-reaching conjectures are stated on the precise nature of the periods 
of automorphic forms. Several of these conjectures have been settled by Shimura 
himself ([Sh90]), in the case where we have a division quaternion algebra. Parallel 
results for the non-division algebra case (i.e,, the case of Hilbert modular forms) 
have been obtained by the author; see [D94]. M. Harris and H. Yoshida also made 
penetrating contributions in this direction. See especially Yoshida’s paper [Y95]. 

We are interested in a further generalization of the type of results treated in 
this section. Namely, we wish to investigate zeta functions of the above type in the 
setting involving a field extension. In particular, we wish to examine the case where 
the field extension is a totally real quadratic extension of a totally real algebraic 
number field. A theta lifting for automorphic forms with respect to such a field 
extension will be constructed in the final section of this paper. 



3. The function /(i/;,s) in the orthogonal case 



As we have explained, the Theorem in the previous section can be considered 
as a generalization of the Theorem we cited in §1. In this section we shall con- 
struct a mapping /(u;,s), which can be regarded as a generalized version of the 
Eisenstein series we discussed in §1. Here the variable w belongs to a certain 
bounded symmetric domain and s is a complex variable. We shall see that f{w,s) 
is nearly holomorphic and arithmetic at certain critical points s. This does not 
merely round out the ideas about arithmeticity introduced in the first section, but, 
more importantly, the proof of the Theorem of §2 is aided by the consideration 
of such a function. (See [Sh81II] for the setting of §2 and [Sh88] for the general 
setting.) Moreover, we shall see that the mapping f{w,s) is defined via a theta 
correspondence, which leads naturally to the main topic of the next section. 

The main references for this section are Shimura’s [Sh86] and [Sh87], as well as 
the work of A. W. Bluher, [B90], [B94], and [B98]. We shall largely follow the 
notation in [Sh86] in this section. The setting of this section includes that of §2 as 
a special case, as we shall see. 

Let F denote a direct sum of totally real algebraic number fields Fi, . . . , Ft. Since 
many of the concepts in this setting are to be defined as natural generalizations of 
corresponding ones in §2, we shall denote them by the same symbols. For example, 
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the symbol gf{z) will now have the meaning 




The symbol J{Fi) here means the set of Archimedean primes of F,-. Such modi- 
fications in meaning will not be be explicitly mentioned, as so doing will increase 
the length of this article unnecessarily. Instead, we refer the reader to the papers 
referred to above for precise definitions, whenever there is a doubt. 

We assume that F is a common subfield of Fi, . . . ,Ft, and that Jf contains a 
subset £ such that the restrictions of places of e to F yields a bijection of £ onto 
J{E)^ with £ D J(F,) 7 ^ 0 for all 1 < i < t. Thus our F is analogous to the field in 
§2 denoted by the same symbol. We write £' — Jf — £• 

We now recall some general principles concerning the action of a certain orthog- 
onal group on a bounded symmetric domain. See [ShSO] and [B90] for details. 

Let n be a positive integer and V be a vector space over R of dimension n -f 2, 
and let 5 be a symmetric bilinear form on V (over R) of signature (n,2). Then 
S can be naturally extended to a C- valued symmetric form on V (8>r C. Let us 
consider the set 



M{S) = {veV0RC\ 5M = 0, S{v,v) < 0}, 

where 5'(v, u). Denote by B the set of ordered bases (a:, y) of two-dimensional 

totally negative subspaces of the quadratic space (V, 5) such that S[x] = 5[y] = —1. 
Then we have a bijection B i — > which is induced by (x^y) ^ x + iy. 

We define the block diagonal matrices 



Q = diag[/n, -h], 



R = diag 





where Ik denotes the identity matrix of order k. Then a matrix A G GLn+ 2 (Q (or, 
strictly speaking, a C-isomorphism V (8>r C — > ) can be found, such that the 

following identities hold: 



S{x^y) = ^{Ax)R(Ay)^ and S{x,y) — ^{Ax)Q{Ay), V.T,y € V <8>r C. 



Therefore the mapping v Av gives a C-linear isomorphism 



A : Af{S) — > AT(Q, R) {u e | hiRu = 0, ^uQu < 0}. 

The space Af{Q^R) has two connected components, which may be described as 
follows. Let 



Zn = {lo G I ^ww < 1 + ^\^ww\'^ < 2}. 



( 3 ) 
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We define two mappings p : t .Sf(Q, R) and p' : J^{Q, R) by 

/w \ { ^ \ 

P(w) = , and p'(iy) = « • 

V 1 / \—J 

Then p and p' naturally give rise to bijective mappings of C x onto the compo- 
nents of Af{Q, R) alluded to above. Indeed, we may simply define (c, w) cp{iu) 
and (c^iu) h-> cp'{w), respectively. Clearly the two components are homeomorphic 
to each other. We denote these components by Afo(Q^R) and Afo{Q^R)^ and the 
corresponding components of Af{S) by Afo{S) and A/J^(5). 

Summarizing, we have the following sequence of bijections: 



(4) 

Zn-^Aro{Q,R)/C^ ^Xo{S)/C^ 

{two-dimensional totally negative subspaces of (V, 5)}. 

We introduce here a majorizing form of S which will be needed in the next 
section. If tx; G then by (4) we see that w corresponds to a subspace W 
spanned by the real and imaginary parts of w' = A~^p{w) over E. Let be 
the orthogonal complement of W. Then S is negative definite on W and positive 
definite on W-^, Our (positive definite) majorizing form Pm is then defined by 
= —S on W and = S on W^. 

For notational clarity, we shall from now on write 

P[v\w] = Pw{v)’ 

We then have the following formula: 

(5) = -4S{w\^)-^\S{v,w')\‘^ > 0. 

Returning to the preparation for the definition of /(ie,s), let Vi be a vector 
space of dimension n,- -f 2 over F{ for each 1 < f ^ with iii > 0, and consider an 
P, -bilinear symmetric form 5, iVixVi — > Fi. We adopt the following notation: 

t 

F = n G(5i) = {a e SL(V;) I 5i(ax,ax) = 5i(x,x)}, 

i=l 

t 

S = {S^,... ,St}, G'(5) = nG(5.). 

1 = 1 

If r G Jf, then r G J{Fi) for some i, and we denote by VV the r-completion of 
Vi and by Sr the natural extension of Si to Vr <S>r C. We also write Ur = n,-. We 
then have the identification 



V (giQ C = Vr (8 >r C. 

t^Jf 
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In the following, we assume that Sr has signature (ur,2) on Vi- if t G e, and 
that Sr has signature {ur + 2,0) on Vr if r 6 e'. 

We can define Zr as in (3) for each r, and let 

Z = l[Zr. 

res 

The symbol S in the definition above is defined as follows. Arrange indices in such 
a manner that we have n, > 0 for 1 < i < r and n,- = 0 for r + 1 < f < t. We 
assume 0 < r < We then put 

J = £ n J{Fi X • • • Fr), and S' = e' H J{Fi x • • • x Fr). 

Given 0 < k G , we define Vk{S) to be the vector space of C- valued polynomial 
functions on rir€<5' which are homogeneous of degree Kr and 5r-harmonic on 
W- (See [ShSO] for the definition of 5r-harmonic functions.) We now take n( 2 ) = 
cj(a)ejc;(a 2 :) to be an element of Mi^o{E) with 0 < /. (Therefore is allowed 
to be a constant.) Let Ue and U be defined in the same way as in the previous 
section. Then we define 

f{w,s) = [U] c(i;)u;(Trir/£;(-5'H))5[i;]-^g(i;)u‘^ 

o^veviu 

Here c G C{V) is a locally constant function, and q G Vk[S). Also, 0 < A: G Z^,0 < 
K G Z*^ , 0 < G Z^, </) G Z^, and ^ is a subset of Jk such that 

Resv/./F,(0 G J(Vi)) = e n J(F,), Vz = r + 1,. . . ,t. 



def 

where K = Vr+i x • • • x Finally, ^ is a factor of automorphy defined by 

IJr[x,w] = Sr(xr,A~^p{Wr)) = \ArX"")Rp{lUr), X e Z. 

We have already explained that consideration of mappings of type /(i^,s) is 
indispensable for algebraicity results such as the Theorem in the previous section, 
which, in turn, yields further results in the study of arithmetic. Here we note that 
iht setting of the previous section corresponds to the case t = r = 1 here. 

We now consider the question of near holomorphy and arithmeticity. The mean- 
ing of near holomorphy and arithmeticity are natural generalizations of the defini- 
tions given in the previous sections. The precise definitions can be found in [Sh86]. 
(See also [B90] and [B9S].) Let us state a result of Shimura ([Sh86]) as follows. 

Theorem. The series f(wyS) can be continued as a meromorphic function to the 
whole s-plane; more precisely, there exist a non-zero entire function -4(5) and a real 
analytic function B(w,s) on Z x <C, which is holomorphic in s, such that 



A{s)f{w,s) — B{w,s) 
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for large Re(s). Furthermore, under certain conditions, f{w,s) is finite, nearly 
holomorphic, and arithmetic. This is more explicitly explained as follows. We 
assume that 

(i) hr > 0 for every r G S and (j)(r ^ 0 for a ^ U xpp; 

(ii) ResF!]^{k — k — 2j) — Resj^yE{4>) ~ ^ where a/ G 2~^Z; and 

(Hi) <t)a > ct)(^p, where p is the complex conjugation. 

Then we put 

t 

bf = af + 2- + 2)[Fi ■. E\!2. 

i-\ 

If bj > 1, then f{w, 0) is finite; in fact it is holomorphic in w with a fe w 
exceptions which can be explicitly identified. Furthermore, with some additional 
conditions, 7r^p/i'(— o) is arithmetic, where the power can he explicitly 
given, and pK is defined in [ShSO] as we alluded to in §J^. 

If hf = 0 or 1/2, then f{w,s) has at most a simple pole at s = l — hf. Let h be 
the residue there. Then, under some additional conditions, (/>, 2^)/i is 

arithmetic, where the power ^ can be explicitly computed, though not always equal 
to the power above. 

The conditions omitted above can be found in [Sh86]. The proof can be found 
in [Sh88]. Shimura’s techniques have been extended by A. W. Bluher to show that 
such results hold for a certain range of other integers and half integers as well. 
Moreover, by imposing somewhat more stringent conditions on the locally constant 
function c as well as the Hilbert modular form fl, Bluher showed that similar results 
hold for an even larger range of critical points for another series f(w,s), which is 
defined as a product of f{w,s) with a certain L-series. These results can be found 
in the papers [B90] and [B94]. 

Another feature of the theory we wish to mention here is the fact that f{w,s) 
has an integral representation of the type 

/(to, -5) = / C)y*E{z, s){dz), 

where E{z,s) is an Eisenstein series and Q{z,w\C) is a certain theta function. 
Therefore, we may say that f{w,s) is defined via a theta correspondence. In this 
regard, Bluher has proved that to a certain extent, arithmeticity is preserved by 
theta correspondence. (See [B98].) This result is relevant in the discussion of the 
arithmeticity of f{iu,s) above. Although we do not wish to discuss this particular 
result in detail, the idea of theta correspondence will be our focus in the next 
section. 

4. Theta correspondence with respect to a quadratic extension 

In this final section of the paper, we shall present a theta correspondence of 
automorphic forms with respect to a totally real quadratic extension of a totally 
real algebraic number field. Thus our setting is again a generalization of that of §2, 
but in a different direction from that of §3. In this setting, one may again hope to 
derive results analogous to those stated in §2, and to do so is the main focus of the 
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author’s current research. Rather than entering into a discussion on the generalities, 
we shall devote most of this section to a description of the correspondence in our 
specific setting, together with some of the properties. 

Let us explain our new setting. Happily, most of the ground work has already 
been laid in the last two sections, and consequently we can afford to be quite 
efficient. Let F be a totally real algebraic number field, and F/F a totally real 
quadratic extension.^ Let F£: be a quaternion algebra over E which is equipped 
with an F-linear automorphism r such that = id^^,, but t\e id£;. The main 
involution oi Be will be denoted by * . 

We shall need to consider the following subsets of Be'- 

B — {x ^ Be I = x} and V = {a* E | = — x*}. 

It can easily be shown that is a quaternion algebra over F, and V is a vector 
space over F of dimension 4 . 

Regarding the adele rings, we fix notation by setting 

Soo = M2(R)'’ X and {Be)oo = M2(R)< x . 

Let a be the set of Archimedean primes of F. For each G a, we fix, once and 
for all, an extension u G J{E). The collection of these primes is then written t. 
Further, we denote by 77 and 77', respectively, the subsets of t corresponding to 6 
and S'. To summarize our notation, we have the following identities: 

Si = SuS', and J(F) = C U C'i 
J{E) = tUri, and ^ = 77U77'; 

C = 77 U rr/, and = 77' U ri]'. 

The meaning of the notation rt, etc., is self-evident. Throughout this paper, we 
shall assume (as in an analogous situation in §2) that C 7^ 0- 

We now define holomorphic automorphic forms on . This time they are matrix 
valued mappings taking values in End(A), where A is defined in the same manner 
as the representation space in §2 denoted by the same symbol, and have weight 
k -f- rk, where k G ZL 

Given a mapping / : » End(A'), we define another mapping of the same 

kind, by the following formula: 

(fWk-\-Tk<^)(w) = j{a, 

We may then define Sk+rk{^) and Sk-\-rk{BE) by generalizing the definition in §2 
in a natural way. The detailed definitions can be found in [D 97 a]. 

To define our theta function, we take 2 N, where N is the norm, to be the 
symmetric F-bilinear form S (on "F) in § 3 . Thus we have 

S(x,y) = Tr(xy*), Vx,?/ G V. 

^Note that E is no longer a subfield of F! 
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In this case, the majorizing form of Sv for v E S can be explicitly computed. It 
turns out that we have 

= 2A^(0 + Im(wi)~^Im(w2)~'|K; wi,W2]P, 

'iveS,'^^ev^,Ww = {wi,w2)eH xH. 

We note that here the discussion in §2 is relevant, since the computations can be 
essentially reduced to that case. 

If C E J^iy) is a locally constant function and an element r £ F is chosen such 
that Ti, > 0 for v £ S and < 0 for v E then the theta function is defined to be 
an End( A') -valued function 0{z^w;C^r) on x as follows: 

( 6 ) 6 >(^,w;C,r) = 



Here 

D,, 1 = / «|f = II € 

" 1 + *Ini(z„)|[^, w]vp(Im(w^)Im(^Oru))~^ if u|f = 

In order to give the theta lift and its properties explicitly, which is crucial for 
algebraicity results, we must also derive explicit transformation formulas for the 
theta function. 

Let us first consider the action of on the variable w. If a E (Be)^ and 

N(a) E F, then 

0{z,w\C,r)Wk-\.rkOi = 0{z,iu;Ca,r), 
where Ca(0 C(a<^a~’’). Also, if p E F^ , then 
0{z^w;C,r) = 0{z,w] C' 
where C"(<J) C{p(). 

We now consider the action of elements of SL 2 (-P) on the variable 2 r. In this 
case we can specialize certain results of Shimura to our setting. See [Sh93] for a 
discussion in the general setting. Let SL 2 (i^) be the G, and the form 2r N be the 
5, in §3 of [Sh 93 ]. (Notice that, therefore, the usage of the symbol S differs in 
this paragraph from the rest of the paper.) Thus S is of signature ( 2 , 2 ) for v £ S 
and signature (0,4) for v E S'. Then the q loc. cit. equals 4. We then have 
F(det( 5 )s) = E and a straightforward calculation shows that 

J^(a,z) - |j(o,z)p'*j(a, 2 )*‘‘, 

where J^(a^z) is Shimura’s notation, except that here we have written - 2 : in place 
of Shimura’s 3 . We further note the following notational correspondences (in our 
specific setting), writing the notations of Shimura in [Sh93] on the right-hand side: 



= c{v) = s{v^y, 
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C G C{V) A G S{V^); -fC for 7 = a G Ga- 

In such a case we have the following results. Notations as above, every 7 G Ga 
gives rise to a C-linear automorphism of CiV)^ (this action depends on the choice 
of r!) which we denote by (7, C) »-> 7C, such that the following properties hold: 

(a) j(7, w; ■yC, r) = e{z, w; C, r), V7 € G\ 

(h) (y 5 )C = y{SC), V7,Jef?A. 

(c) For every C, there exists a congruence subgroup F of G, such that 

7C = G, V7 G r. 



(d) If (2) G Gh and C(2) = 0, then 

((2)C)(x) = |a(2)|iwh(a(2))eh(rN(x)a(2)6(2))C'(xa(2)), 

where uj denotes the Hecke character of F corresponding to E, and eh(y) eA(yh)* 
We also have, for p G E, p ^ 0, 

0(jpz^w\ G, r) = 0[z^w\p^'^'^C^pr). 

We now define the theta lifting as follows. Given h G Sk-\-Tk[BE)^ we define 
I{z\C,r\h) = {6{z,w\C,r),h{w)). 



Theorem. We have 

I{z-, C, r; h) = vol{D)-'^ J Tr (* 0 {z,w]C, r)/j(ty)) d^jW, 

where D = A\H^ and A a suitable congruence subgroup. I{z\ C^r] h) is a Hilbert 
modular form belonging to <S/t(SL2(E)), and its Fourier coefficients are related to 
the periods of h. More precisely, we have an expression of the form 

vol{D)I{z-, C, r; /i) = k ^ N(a)r‘'C'(Q)P(/j, a, A)cF(-r N(a)z), 

a 

where P(/i,a, A) is a certain period of h, which is defined by 

P(h,a,A)= [ [a, pw]^"^ Tr (^( 7 {a)h{pw)) d{Aap). 

‘/Aa\Ja ^ ^ 

Here a is a totally negative element of V , and w G H^ can be arbitrarily chosen. 
In fact, the above integral is independent of the choice of w, and vanishes for all a 
which are not totally negative. Finally the symbols Aq and Ja are suitably chosen 
subgroups o/SL2(R)‘*. 

For a proof of the above theorem, see [D 97 a]. This theorem generalizes a theorem 
of Shimura (see [Sh 82 ]). Its significance is partly due to the relation to the period 
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conjectures of Shimura; for a discussion of that connection we refer the reader to 
[Sh88]. On a more concrete level, we see that several of the key concepts discussed 
in §2 have already made an appearance in the Theorem above. In any case, this 
clearly provides a setting in which suitable generalizations of theorems of the type 
mentioned in §§2-3 can be sought. In fact, if we retrace our development through 
those two sections, we may to a certain extent anticipate such results. Therefore, the 
consideration of the setting of this section is natural. Also, in view of Yoshida’s work 
(see[Y95]), our setting is relevant. Without engaging ourselves in any speculations, 
we simply mention that the author has obtained several results in this direction, 
which can be found in the papers [D97a] and [D97b]. A third work is currently 
under preparation, which he hopes to complete in the near future. Since these 
results require a considerable amount of further technical preparation, their precise 
statements are omitted here. 
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MORPHIC HEIGHTS AND PERIODIC POINTS 



M. EINSIEDLER, G. EVEREST, AND T. WARD 
June 13, 2000 

Abstract. An approach to the calculation of local canonical mor- 
phic heights is described, motivated by the analogy between the 
classical height in Diophantine geometry and entropy in algebraic 
dynamics. We consider cases where the local morphic height is 
expressed as an integral average of the logarithmic distance to the 
closure of the periodic points of the underlying morphism. The 
results may be thought of as a kind of morphic Jensen formula. 



1. Introduction 

Let (j) : (Q) (Q) be a morphism of degree d, defined over the 

rationals. Call, Goldstine and Silverman (see [3], [4]) have associated to 
0 a canonical global morphic height on Q, with the properties that 

1. A^((/>(q')) = d\^{q) for any q € P^Q); 

2. q is pre-periodic if and only if = 0. 

A point q is called pre-periodic under 0 if the orbit {0^^^(g)} is finite 
(write for the nth iterate of a map /). The global height decom- 
poses into a sum of local canonical morphic heights X(p^y: 

=^nyX^^y{q). 

V 

Here v runs over all the valuations (both finite and infinite) of the num- 
ber field generated by q and the riy denote the usual normalising con- 
stants. In the special case that 0 takes the form 0[x, y] = [y^ f {x / y) , 
for a polynomial / of degree d. Call and Goldstine [3] prove that 

Vv(?) = „lim TA„(,^(")(qr)), (1) 

where is the local projective height Xv{q) = log"^ \q\v^ and that the 
local height A<^,^(g) vanishes if and only if \(p^'^\q)\v is bounded for all 
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n, and finally, that q is pre-periodic if and only if 

~ ^ (2) 



Example 1.1. 1. Let f{z) = with d > 1. Here the canonical 

morphic heights and the projective heights agree. Jensen’s for- 
mula ([13, Theorem 15.18]) gives 

/ log \y - q\dm{y) = log+ |g|, 

where m is the Haar measure on the circle Sh The circle is also 
the Julia set for this morphism on C, and it is the closure of the 
set of non-zero periodic points, which are all roots of unity. In 
the p-adic case, the Julia set is empty but it is still true that the 
local height is the Shnirehman integral of the logarithmic distance 
from the closure of the set of periodic points. For all finite and 
infinite, the following holds: 

log+ kl. = „lim 4 E log 1C - q\v 

For finite v this may be seen, for instance, using Section 3. Alter- 
natively, it follows from the Diophantine estimate 

k*^” - ?|i, > C{q)\n\y 



provided the left hand side is non- zero (cf. Remark 3.6). For r»|oo 
a result from transcendence theory (in this case. Baker’s theorem) 
is needed. 

Suppose a, 6 G Q with 4a^ -f 276^ ^ 0 and let 
... z^ — 2az'^ — Sbz + 

i n a z + i ■ 



Then / gives rise to a morphism of degree 4 which describes the 
duplication map on an elliptic curve. The global and local morphic 
heights coincide with the usual notions of height on the curve. 
For the infinite valuations, the local height is again the integral 
average of the logarithmic distance to points on the Julia set, 
which is the closure of the set of periodic points. At the singular 
reduction primes it is still true that the local height is the integral 
average of the logarithmic distance to the periodic points. Both 
of these assertions are proved in [6] where this morphism was used 
to construct a dynamical system which interprets these heights in 
dynamical terms. The proofs require elliptic transcendence theory 
to show that a rational point cannot approximate a periodic point 
too closely (cf. Proposition 3.8). This is part of a much broader 
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analogy between heights in Diophantine geometry and entropy in 
algebraic dynamics (see [5], [6], [7], [10]). 

In this paper, our purpose is to describe a family of examples where 
the local canonical morphic height can be expressed as a limiting inte- 
gral over periodic points of the underlying morphism. The finite and 
infinite cases require different approaches. In both, we consider the spe- 
cial class of morphisms corresponding to affine polynomial maps and 
in the former case, we assume good reduction in the sense of Morton 
and Silverman [12]. 

There are two directions in which this work can be made more so- 
phisticated that are not pursued here. The first is to give a more formal 
interpretation of the limiting process using Shnirel’man integrals (see 
[14], or [8] for a modern treatment); the second is to extend the argu- 
ments to other morphisms. 

2. Complex case 

Assume that (f) : P^(C) — > P^(C) is a morphism of degree d, with 
(!)[x^y\ = [y^ f {x / y) ^ y^] for a polynomial / of degree d. For basic def- 
initions of complex dynamics, consult [1]. The following theorem ex- 
presses the local morphic height as an integral over the Julia set J(/) 
of the polynomial /, and standard results from complex dynamics show 
that this is in turn a limiting integral over periodic points. 

Theorem 2.1. If f{z) — az^ ao is a polynomial, then for any 

g G C, 

A<^,oo(g) = log l“l + log |a; - q\dm{x), (3) 

where m is the maximal invariant measure for f on J{f). 

Proof. Assume first that q is in the domain of attraction of oo for 
/. The zeros of the polynomial fn{^) — ~ ^ precisely the 

solutions of the equation f^'^\x) = x. Note that dn = deg(fn) = dP, 
where d = deg(/). Since \ f^'^\q)\ — > oo, ^log|/n(g)| is approximately 
^ log [/^"^^(g)!, which converges to A^,oo(^)- Since q lies in the open 
Fatou set, log \x — q\ is continuous on J(/). Now 

4log|/n(9)| = 4 J2 log|x-Q| + ^log|S„|, (4) 

where the sum is over the nth ‘division points’ and 

R = a^+d+d^+-+d(r^-^^ 
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is the leading coefficient of Thus 

^log|B„l = i(~)logH-^logH. (5) 

Now it is known that 

h T log|a; - g|dm(a;), 

/(n)(x)=x 

where m is the maximal invariant measure for / restricted to the Julia 
set (see [11]; [9]). 

In the following it is convenient to assume that a=l, we can ensure 
this by conjugating by a linear map. Now \x — f{q)\ = ri/(t)=x 1^ ~ 
so 




(The last equality follows from [11] or [9, Theorem (d)]). 

Let now q ^ J(/) have bounded orbit. Since J(/) is closed, we can 
find e > 0 such that B^{q) fl J(/) = 0. If \ f^{q) — ^| > e/2 for almost 
all n, then 

-/log|/”(g) - 9l 0. 

We can argue as in the first case to get 

/ log|x - g| = 0 = A^,oo(g)- 

Assume now \f^^{q) — q\ < e/2 for some sequence rij — ^ cx). Then 
\f^^{q) — x\ > e/2 for x 6 J(/). However, since J(/) and /"^^(g) are 
bounded, we have also an upper bound 

log ^ < / log |x - /"^ (q) I < M. 

^ Jj(f) 

Together with Equation (6) we get 

l/(P= log I < / log \x-q\< l/(TiM 
^ 7j(/) 

which concludes the proof in this case. 

It remains to show that the formula holds for q E t/(/). Since J(/) 
has no interior, there is a sequence qn q with q^ ^ J{f)- Then 
log \x — qn\ log \x — q\ for all x E J{f)\{q}- Since J(/) is bounded. 
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log \x — Qn\ and log \x — q\ are uniformly bounded above by 
X G So by Fatou’s lemma 



0 



lim / log \x — qn\dm(x) < / log \x — q\dm(x 

n^oojj^f^ 7j(/) 



M say for 
< M. 

( 7 ) 



This shows that x log |x — ^| is in L^{m). If 

X — q\dm{x) > 0, 

then Equation (6) contradicts (7). □ 




3. The p-ADic case 

Let Cp denote the usual completion of the algebraic closure of the 
p-adic numbers Qp , and use | • | to denote the extension of the p- 
adic norm to Cp. Write Op for the ring of integral elements in Cp, 
and V for the maximal ideal of Op. In this section we assume that 
(j) : P^(Cp) P^(Cp) is a morphism of degree d corresponding to an 
affine polynomial / of degree d with coefficients in Op and leading 
coefficient in O*. Notice that these assumptions are, for polynomials, 
equivalent to the assumption that the map (f) has good reduction in the 
sense of [12]: (f) induces a morphism of schemes over Spec((9p). The 
Julia set is empty in this setting (see [2], [12]), so a direct analogue of 
(3) is not possible. 

The main result expresses the local canonical morphic height as a 
limiting integral over periodic points for the polynomial /. 

Theorem 3.1. 7/0 has good reduction and is defined by a polynomial 
f of degree d, then 

A</.,p(g) = log+ |g| = Jim ^ y; log le - q\ 

where the sum is taken with multiplicities. 

Notice first that for q G Cp\Op this is clear, so from now on we 
assume that q ^ Op. Despite the simple resulting value of the height, 
the convergence involved requires an argument. The main issue is to 
produce lower bounds on the size of |^ — Cl fc>r distinct periodic points 
C and C- 

The least period of a periodic point f is the cardinality of the orbit of 
f. The points of period n are the solutions to the polynomial equation 

- X = 0, (8) 

and are therefore all elements of Op. Following [12], for a periodic 
point C define an(C) fo be the multiplicity of C in (8), with the obvious 
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convention that ^ has multiplicity zero in an equation that it does not 
satisfy. Notice that an{^) ^ 0 if and only if n is a multiple of the least 
period of Define a* (^) by 

d\n 

where /i is the Mobius function. Increases in the multiplicity of the 
periodic point ^ along the sequence of multiples of its least period are 
recorded by a* ((^). The periodic point is an essential n-periodic point 

if<(O>0. 

The following proposition is a special case of [12, Prop. 3.2]. 

Proposition 3.2. Let K he an algebraically closed field of characteris- 
tic p > 0, and f a polynomial over K with degree d> 2. Fix a periodic 
point Q ^ K with least period m, and let r denote the multiplicative 
order of (f^'^^YiC) in K* or oo if is not a root of unity. Then 

for n >1, > 1 if and only if one of the following conditions hold. 

1. n = m; 

2. n = mr; 

3. p > 0 and n = p^mr for some e > 1. 

When K = Cp, this proposition will also be applied to the poly- 
nomial / induced by reduction mod V. Notice that the sum of the 
multiplicities of the points of period n under / lying in one residue 
class gives the multiplicity of the image point as a point of period n 
under /. 

For the proof of Theorem 3.1 the following proposition is needed; 
this will be proved later. 

Proposition 3.3. Suppose 'Is a periodic point with least period n 
for a polynomial f of good reduction. Then for any fixed q E Op, 
k “ Cn| 1 as n oo. 

Proof, (of Theorem 3.1) By Proposition 3.2 applied to the field Cp with 
characteristic zero, if ^ is a periodic point with least period m, then the 
multiplicity of ^ viewed as a periodic point of period m, 2m, 3m, ... is 
uniformly bounded. Fix q e Op and s G (0, 1). Proposition 3.3 says 
that the number of periodic points in the metric ball Ds{q) is finite. It 
follows that 

^ ^ 1^ - > log s- 

On the other hand, each term in the sum is non-positive, so letting 
s — > 1 proves the theorem. □ 
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All that remains is to prove Proposition 3.3, for which we need some 
lemmas. 



Lemma 3-4. Assume that /(O) = 0, and let Q be a periodic point of 
f. Then |C| = for all n > 1. 

Proof. The spherical metric used in [12] coincides with the usual metric 
in Op, and / has good reduction. So by [12, Prop. 5.2], 

l/(a^) - f{y)\ <\x-y\ 

for x,y E Op. The lemma follows at once. □ 

Lemma 3.5. Assume that /(O) =0 and n> 1 is fixed. 

1. //|/'(0)|<1 </ien = 1 - 



Proof. In case 1., f'{0) == 0 in the algebraically closed field OpjV, so 
Proposition 3.2 may be applied with C, = Q -i-V , m — 1 and r = oo. It 
follows that for n > 1 a* (0 -f P) = 0, so there cannot be an essential 
n-periodic point ^ for / with |^| < 1. 

In case 2., m — 1 and r = 1 for the point ^ = 0 + ^ in Proposition 
3.2. It follows that only values of n of the form p^ are relevant. Notice 
that 



n 

Ct^o 






( 9 ) 



If /'(O) 7 ^ 1, then the right-hand side of (9) is given by 



(/'(O)F'^ - 1 
(/'(o)K~^ - 1 



1 

p 



by the binomial theorem. If /'(O) = P write f{x) = a; -f x^g{x) with 
e > 1 and ^(0) ^ 0, then a simple induction argument shows that 



f^^\x) — X kx^g{x) + ^ 

It follows that (9) is equal to p~^ again. 



□ 



Proof, (of Proposition 3.3) Let ( be any periodic point, with least pe- 
riod £. The first step is to prove the proposition for g = C- Let ^ 
have least period n under /. The multiplicity of C + P, which has 
least period m for some m\£ must increase at £ (because a^(C) > I)- If 
follows by Proposition 3.2 that £ is equal to m, mr, or mrp^ for some 
e > 1. Assume first that n is not of one of those forms; then |<f — Cl = I 
because the multiplicity of C + 'P cannot increase at n in OpjV. 
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In the remaining cases, we may assume for large n that ^\n. Then 
^ is a periodic point with least period n/(. under Applying the 
conjugation x ^ x — C means that 0 is a fixed point of g, where 

g(a:) = /W(x + 0 - C- 

If |^'(0)| < 1, then by Lemma 3.5 applied to g, |C ~ Cl = 1- 
If |^'(0)| = 1, let t be the order of ^'(0) + 7^ in Op/V. Then 

There exists a c > 1 such that 

|(g'(0)r - 1| < l/P- 

As before, we may assume that tp^i\n^ so Lemma 3.5 may be applied 
to the map h = and the periodic point ^ — C of least period 
to give 

It follows by Lemma 3.4 that |C “ Cl ^ rp-tv'^tln^ Since depend 
only on<^, |C“CI 1 oo. The ultrametric inequality in Cp 

now gives the result for any q E Op. □ 

Remark 3.6. Notice that the discussion above also gives a quanti- 
tative version of Proposition 3.3. This Diophantine result may be of 
independent interest. If / is a polynomial of good reduction, then 

\f^^\q)-q\>C{f,q)\n\ 

for all n > 1, provided the left hand side is non- zero. 

Example 3.7. To see the different cases that are possible in Proposi- 
tion 3.3, consider the following examples. 

1. Let f{x) = g{x^) -\-ph{x) be a monic polynomial with coefficients 
in Op. Then |/'(^)| < 1 for any q G Op, so in Lemma 3.5 only the 
first case is ever used. It follows that in Proposition 3.3, |C~Cl — 1 
for any distinct periodic points 

2. Let f{x) = x'^ — (1 + a)x for some small a. Then 0 is a fixed point, 
and |/'(0)| = 1. Now 

f^^\x) — X = x'^ — (2a -h 2)x^ -h {o? -f a)x^ -f (a^ -f 2a)x, 

so {f^‘^\x)—x)/{f{x)—x) has constant term (a^ + 2a)/(— a — 2) — 
—a. Therefore there must be two non-zero points of period 2 that 
are close to the fixed point 0. 

If the polynomial / has coefficients outside Op, then in contrast to 
Proposition 3.3, there may be sequences of periodic points converg- 
ing to a periodic point. For example, f{x) = on C 2 has this 
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property. Therefore, to recover Theorem 3.1 in greater generality (for 
polynomials of bad reduction or rational functions) some kind of Dio- 
phantine approximation results are needed. In Example 1.1.2 these 
tools are provided by elliptic transcendence theory. 

Proposition 3.8. Let C be a periodic point with least period £ under 
f. Assume that |(/^^^^(C)I > 1- Then there are periodic points C 7^ C 
arbitrarily close to 

Proof. Define g = and a = Without loss of generality 

we can assume that C == 0- Then 

g{x) = ax -\- bx^ + 0{x^'^^) with b 0 

and 

g^‘^\x) — a^x -h {ab + a^b)x^ + 

Define 62 = {cib -|- a® 6 ), then I 62 I = |^||<^|^- By induction one can see 
that 

= af^x + bkX^ + 0(x®+i) 

with |ifc| = | 6 ||a|*‘'. 

Therefore the Newton polygon of g^^\x) — x starts with a line with 
slope s < — /c + c for a fixed c (depending on b and e). So there exists 
a periodic point ^ with |^| = p^ < p~^'^^. □ 

4. Tchebycheff polynomials 

Example 4.1. Consider the Tchebycheff polynomial of degree d, f{z) = 
Td{z) — cos(darccos( 2 :)). The Julia set is the interval J{f) — [—1, !]• 
The map 0 : C — > C given by (j){z) — ^{z z~^) is a semi-conjugacy 

from g \ z ^ z^ onto z 1 — » /( 2 :), in other words, f{(j){z)) = </>(^^). Write 
^ for the branch of the inverse of (j) defined on{ 2 :EC||z|>l}. The 
canonical morphic height at the infinite place is (for q ^ J(f)) 

A^, 00 (g) = J-irn J^log+|/(")( 9 )| 

= Ji™ J;;log+|(^£?(”V(g)l 

= lim J-log'*' |1 (g^^^'tl’iQ) + / f , X ) I 

n^oodp- * '2 \ ' g^”V(g)/ 

= { 0 . Jr log |g^"V(g)l} 

= log+|V^(g)|. 

For q G J(/), the same formula holds since X(p^oo{(l) — 0 there by [3] 
and log”^ |'0(^)| = 0 there by a direct calculation. 
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Now by Jensen’s formula, for any g € C, 

log+|^(g)| = log2+ f \(j){y)-q\dy 

= log2+ / log |f - g|dm(f) 

J J{f) 

since m is the image under (p of the maximal measure (Lebesgue) on 
the circle. That is, 

A,/,, 00 (g) = log 2 + f log \t - q\dm{t). (10) 

The constant log 2 in Xoo{q) may be explained in accordance with 
Theorem 2.1. The leading coefficient of Td is 2^“^, so ^ log |a| in this 
case is exactly log 2. 

A similar approach can be adopted in the case of polynomials with 
connected Julia sets. There the local conjugacy near oo extends to the 
whole domain of attraction of oo, which is the complement of the filled 
Julia set. 

Example 4.2. As before, let f{x) = Td(x) = cos(darccos(x)) be the 
Tchebycheff polynomial of degree d and let (p be the corresponding 
morphism. We would like to use Theorem 3.1, but / does not satisfy 
the assumptions since it is not monic. 

Let g{x) = 2/(|). Notice that / is defined uniquely by the property 
f (^{z z~^)) = z~"^). It follows that g is characterized by the 

property g{z-Vz~^) = (z^-f-z“^), which shows that g G Z[x] is a monic 
polynomial. Let ip be the morphism defined by g^ then by Theorem 3.1 
we have for q E Cp 

Av,,p(g) = log+ |g| = ^ ^ log|^-g|. (11) 

Since g{x) = 2/(|), we have that == \^^p(2q) and on the right 

hand side of (11) that ^ if if ~ Therefore 

A<^,p(g) = log+ I2gl = log|2| 4- Y, log|^-g|> 

which is again analogous to Equation (10) in Example 4.1. 

Example 4.2 works because the Tchebycheff polynomial can be con- 
jugated to a polynomial of good reduction; a similar approach can be 
adopted for any polynomial that is conjugate to one of good reduction. 
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THE ELEMENTARY PROOF OF THE PRIME NUMBER THEOREM: 
AN HISTORICAL PERSPECTIVE 

(by D. Goldfeld) 

The study of the distribution of prime numbers has fascinated mathematicians since 
antiquity. It is only in modern times, however, that a precise asymptotic law for the number 
of primes in arbitrarily long intervals has been obtained. For a real number x > 1, let 7t(x) 
denote the number of primes less than x. The prime number theorem is the assertion that 

lim 7t(x) / ^ = 1. 

X-^CX) / log(.Tj 

This theorem was conjectured independently by Legendre and Gauss. 

The approximation 

^ A\og{x) + B 

was formulated by Legendre in 1798 [Lei] and made more precise in [Le2] where he 
provided the values A = 1, B = —1.08366. On August 4, 1823 (see [Lai], page 6) Abel, 
in a letter to Holmboe, characterizes the prime number theorem (referring to Legendre) as 
perhaps the most remarkable theorem in all mathematics. 

Gauss, in his well known letter to the astronomer Encke, (see [Lai], page 37) written 
on Christmas eve 1849 remarks that his attention to the problem of finding an asymptotic 
formula for ty{x) dates back to 1792 or 1793 (when he was fifteen or sixteen), and at 
that time noticed that the density of primes in a chiliad (i.e. [x,x + 1000]) decreased 
approximately as l/log(x) leading to the approximation 

The remarkable part is the continuation of this letter, in which he said (referring to 
Legendre’s approximation and Legendre’s value A{x) = 1.08366) that whether 

the quantity A{x) tends to 1 or to a limit close to 1, he does not dare conjecture. 

The first paper in which something was proved at all regarding the asymptotic dis- 
tribution of primes was Tchebychef’s first memoir ([Tchl]) which was read before the 
Imperial Academy of St. Petersburg in 1848. In that paper Tchebychef proved that if any 
approximation to tt(x) held to order x/\og{x)^ (with some fixed large positive integer TV) 
then that approximation had to be Li(x). It followed from this that Legendre’s conjecture 
that lim A{x) = 1.08366 was false, and that if the limit existed it had to be 1. 

x—^oo 

The first person to show that tt{x) has the order of magnitude was Tchebychef 

in 1852 [Tch2]. His argument was entirely elementary and made use of properties of 
factorials. It is easy to see that the highest power of a prime p which divides x! (we 
assume x is an integer) is simply 



X 


+ 


X 


+ 


X 


-f 


.p\ 




p2 




p3 





179 



D. Chudnovsky et al. (eds.), Number Theory 
© Springer- Verlag New York, Inc. 2004 




180 



where [t] denotes the greatest integer less than or equal to /;. It immediately follows that 

x\ = p[•'^/p] + [^/p“] + ••• 

p<.T 



and 



log(.T!) 




Now log(.7:!) is asymptotic to x’log(x) by Stirling’s asymptotic formula, and, since squares, 
cubes, ... of primes are ccmiparatively rare, and [x/p] is almost the same as .x/p, one may 
easily infer that 



xE— =^log(,x) H 

p<x ^ 

from which one can deduce that 7t{x) is of order 
Tchebychef, who actually proved that [Tch2] 

for all sufficiently large numbers x, where 



0(x) 

This was essentially the method of 
5 



log 2 log 3 log 5 log 30 

2 ~5 W~ 



0.92129 



and 

6B 

1.10555. 

5 

Unfortunately, however, he was unable to prove the prime number theorem itself this way, 
and the question remained as to whether an elementary proof of the prime number theorem 
could be found. 

Over the years there were various improvements on Tcheby chef’s bound, and in 1892 
Sylvester [Syll], [Syl2] was able to show that 



0.956 < 7t(x) / - — — — < 1.045 

/ log(a^) 

for all sufficiently large x. We quote from Harold Diamond’s excellent survey article [D]: 

The approach of Sylvester was ad hoc and computationally complex; it offered 
no hope of leading to a proof of the P.N.T. Indeed, Sylvester concluded in his 
article with the lament that “...we shall probably have to wait [for a proof of 
the P.N.T. ] until someone is born into the world so far surpassing Tchebychef 
in insight and penetration as Tchebychef has proved himself superior in these 
qualities to the ordinary run of mankind.” 
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The first proof of the prime number theorem was given by Hadamard [HI], [H2] 
and de la Vallee Poussin [VP] in 1896. The proof was not elementary and made use of 
Hadainard’s theory of integral functions applied to the Riemann zeta function (^{s) which 
is defined by the absolutely convergent series 



c(«) = 

n = l 



for Re (s) > 1. A second component of the proof was a simple trigonometric identity 
(actually, Hadamard used the doubling formula for the cosine function) applied in an 
extremely clever manner to show that the zeta function didn’t vanish on the line Re{s) = 1. 
Later, several simplified proofs were given, in particular by Landau [L] and Wiener [Wl], 
[W2], which avoided the Hadamard theory. 

In 1921 Hardy (see [B]) delivered a lecture to the Mathematical Society of Copen- 
hagen. He asked: 

“No elementary proof of the prime number theorem is known, and one may 
ask whether it is reasonable to expect one. Now we know that the theorem 
is roughly equivalent to a theorem about an analytic function, the theorem 
that Riemann’s zeta function has no roots on a certain line. A proof of such 
a theorem, not fundamentally dependent on the theory of functions, seems to 
me extraordinarily unlikely. It is rash to assert that a mathematical theorem 
cannot be proved in a particular way; but one thing seems quite clear. We have 
certain views about the logic of the theory; we think that some theorems, as we 
say ‘lie deep’ and others nearer to the surface. If anyone produces an elementary 
proof of the prime number theorem, he will show that these views are wrong, 
that the subject does not hang together in the way we have supposed, and that 
it is time for the books to be cast aside and for the theory to be rewritten.” 

In the year 1948 the mathematical world was stunned when Paul Erdos announced 
that he and Atle Selberg had found a truly elementary proof of the prime number theorem 
which used oidy the simplest properties of the logarithm function. Unfortunately, this 
announcement and subsequent events led to a bitter dispute between these two mathe- 
maticians. The actual details of what transpired in 1948 have become distorted over time. 
A short paper, “The elementary proof of the prime number theorem,” by E.G. Straus has 
been circulating for many years and has been the basis for numerous assertions over what 
actually happened. In 1987 I wrote a letter to the editors of the Atlantic Monthly (which 
was published) in response to an article about Erdos [Ho] which discussed the history of 
the elementary proof of the prime number theorem. At that time Selberg sent me his file 
of documents and letters (this is now part of [G]). Having been a close and personal friend 
of Erdos and also Selberg, having heard both sides of the story, and finally having a large 
collection of letters and documents in hand, I felt the time had come to simply present the 
facts of the matter with supporting documentation. 

Let me begin by noting that in 1949, with regard to Paul Erdos’s paper, “On a 
new method in elementary number theory which leads to an elementary proof of the prime 
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number theorem,” the Bulletin of the American Mathematical Society informed Erdos that 
the referee does not recommend the paper for publication. Erdos immediately withdrew 
the paper and had it published in the Proceedings of the National Academy of Sciences 
[E] . At the same time Atle Selberg published his paper, “An elementary proof of the 
prime-number theorem,” in the Annals of Mathematics [S]. These papers were brilliantly 
reviewed by A.E. Ingham [I]. 

The elementary proof of the prime number theorem was quite a sensation at the 
time. For his work on the elementary proof of the PNT, the zeros of the Riemann zeta 
function (showing that a positive proportion lie on the line |), and the development of the 
Selberg sieve method, Selberg received the Fields Medal [B] in 1950. Erdos received the 
Cole Prize in 1952 [C]. The Selberg sieve method, a cornerstone in elementary number 
theory, is the basis for Chen’s [Ch] spectacular proof that every positive even integer 
is the sum of a prime and a number having at most two prime factors. Selberg is now 
recognized as one of the leading mathematicians of this century for his introduction of 
spectral theory into number theory culminating in his discovery of the trace formula [A- 
B-G] which classifies all arithmetic zeta functions. Erdos has also left an indelible mark on 
mathematics. His work provided the foundations for graph and hypergraph theory [C— G] 
and the probabilistic method [A-S] with applications in combinatorics and elementar}^ 
number theory. At his death in 1996 he had more than 1500 published papers with many 
coauthored papers yet to appear. It is clear that he has founded a unique school of 
mathematical research, international in scope, and highly visible to the world at large. 

Acknowledgment: The author would like to thank Enrico Bombieri, Melvyn Nathanson, 
and Atle Selberg for many clarifying discussions on historical detail. In addition I received 
a wide variety of helpful comments from Michael Anshel, Harold Diamond, Ron Graham, 
Dennis Hejhal, Jeff Lagarias, Attila Mate, Janos Pach, and Carl Pomerance. 

March 1948: Let 'd(x) = log(p) denote the sum over primes p < x. The prime number 

p<x 

theorem is equivalent to the assertion that 

lim = 1. 

x-^oo X 

In March 1948 Selberg proved the asymptotic formula 

'd{x)\og{x) + ^log{p)T9 = 2xTog(x) + 0{x). 

He called this the fundamental formula. 

We quote from Erdos’s paper, “On a new method in elementary number theory which 
leads to an elementary proof of the prime number theorem,” Proc. Nat. Acad. Scis. 1949: 

“Selberg proved some months ago the above asymptotic formula, ... the 
ingenious proof is completely elementary ... Thus it can be used as a 
starting point for elementary proofs of various theorems which previously 
seemed inaccessible by elementary methods.” 
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Quote from Selberg: Letter to H. Weyl Sept. 16, 1948 

“I found the fundamental formula ... in March this year ... I had found a 
more complicated formula with similar properties still earlier.” 

April 1948: Recall that 'd{x) = log(p)- Define 

p<x 

a = hmmf A = hmsup . 

X X 

Sylvester’s e^stimates guarantee that 

0.956 <a<A< 1.045. 

In his letter to H. Weyl, Sept. 16, 1948, Selberg writes: 

“I got rather early the result that a A = 2,” 

The proof that a + A = 2 is given as follows. Choose x large so that 

p(x’) = ax o{x). 

Then since 'd(.x) < Ax T o{x) it follows from Selberg’s fundamental formula that 
ax log(a;) + ^ A- log(p) > 2x log(x) + o{x log(.T)). 

p<x ^ 

Using Tcheby chef’s result that 

P 

p<x 

it is immediate that a -f A > 2. On the other hand, we can choose x large so that 

'd(x) = Ax + o{x). 

Then since i9{x) > ax o(x) it immediately follows as before that 

A.t log(x) + a- log(p) < 2x log(x) + o(x log(x)), 

z — / p 

p<x ^ 

from which we get a 4- A < 2. Thus 

<2 T A = 2. 

Remark: Selberg was aware of the fact (already in April 1948) that a + A = 2, and that 
the prime number theorem would immediately follow if one could prove either a = 1 or 
A=l. 
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May-July 1948: We again quote from Selberg’s letter to H. Weyl of Sept. 16, 1948. 

“In May I wrote down a sketch to the paper on Dirichlet’s theorem, during 
June I did nothing except preparations to the trip to Canada. Then around 
the beginning of July, Turan asked me if I could give him my notes on 
the Dirichlet theorem so he could see it, he was going away soon, and 
probably would have left when I returned from Canada. I not only agreed 
to do this, but as I felt very much attached to Turan I spent some days 
going through the proof with him. In this connection I mentioned the 
fundamental formula to him, . . . However, I did not tell him the proof 
of the formula, nor about the consequences it might have and niy ideas in 
this connection... I then left for Canada and returned after 9 days just as 
Turan was leaving. It turned out that Turan had given a seminar on my 
proof of the Dirichlet theorem where Erdos, Chowla, and Straus had been 
present, I had of course no objection to this, since it concerned something 
that was already finished from my side, though it was not published. In 
connection with this Turan had also mentioned, at least to Erdos, the 
fundamental formula^ this I don’t object to either, since I had not asked 
him not to tell this further.” 

July 1948; Quote from E.G. Straus’ paper, “The elementary proof of the prime number 
theorem.” 

“Turan who was eager to catch up with the mathematical developments 
that had happened during the war, talked with Selberg about his sieve 
method and now famous inequality (Fundamental Formula). He tried to 
talk Selberg into giving a seminar ... Selberg suggested Turan give the 
seminar. 

This Turan did for a small group of us, including Chowla, Erdos and 
myself, ... After the lecture ... there followed a brief discussion of the 
unexpected power of Selberg’s inequality.” 

“Erdos said, 

/ think you can also derive 



lim ^ = 1 

71 — 00 



from this inequality. 

In any case within an hour or two Erdos had discovered an ingenious 
derivation from Selberg’s inequality. After presenting an outline of the 
proof to the Turan Seminar group, Erdos met Selberg in the hall and told 
him he could derive > 1 from Selberg’s inequality.” 

“Selberg responded something like this: 
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You must have made a mistake beeause with this result I can get an ele- 
mentary proof of the prime number theorem, and I have convinced myself 
that my inequality is not powerful enough for that. 

Quote from Weyl’s letter to Selberg August 31, 1948 

“Is it not true that you were in possession of what Erdos calls the fun- 
damental inequality and of the equation a -f A = 2 for several months 
but could not prove a = A 1 until Erdos deduced — > 1 from your 
inequality?” 

Here is Selberg’s response in his letter to Weyl, Sept. 16, 1948. 

“l\iran had mentioned to Erdos after my return from Montreal he told me 
he was trying to prove > 1 from my formula. 

Actually, I didn’t like that somebody else started working on my unpub- 
lished results before I considered myself through with them.” 

“But though I felt rather unhappy about the situation, I didn’t say any- 
thing since after all Erdos was trying to do something different from what 
I was interested in. 

In spite of this, I became ... rather concerned that Erdos was working on 
these things . . . 

I, therefore, started very feverishly to work on my own ideas. On Piiday 
evening Erdos had his proof ready (that 1 ) and he told it to me. 

On Sunday afternoon I got my first proof of the prime number theorem. 

I was rather unsatisfied with the first proof because it was long and indi- 
rect. After a few days (my wife says two) I succeeded in giving a different 
proof.” 

Quote from Erdos’s paper, “On a new method in elementary number theory which leads 
to an elementary proof of the prime number theorem,” Proc. Nat. Acad. Scis. 1949: 

“ Using (1) (fundamental formula) I proved that — > 1 as 7z — ^ oo. In 
fact, I proved the following slightly stronger result: To every e there exists 
a positive 6{e) so that for x sufficiently large we have 

7t(x(1 + e)) - 7t{x) > S{e)x/ log(x) 

where 7r(x) is the number of primes not exceeding x. 

I communicated this proof to Selberg, who, two days later . . . deduced 
the prime number theorem.” 

Recently, Selberg sent me a letter which more precisely specifies the actual dates of events. 

Quote from Selberg’s letter to D. Goldfeld, January 6, 1998: 

“July 14, 1948 was a Wednesday, and on Thursday, July 15 I met Erdos 
and heard that he was trying to prove ^ 1. I believe Turan left the 
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next day (Friday. July 16), at any rate whatever lecture he had given (and 
I had not asked him to give one!) he had given before my return, and he 
was not present nor played any part in later events. Friday evening or it 
may have been Saturday morning, Erdos had his proof ready and told me 
about it. Sunday afternoon (July 18) I used his result (which was stronger 
than just 1, he had proved that between x and x(l H- S) there are 

more than c((5)y^^^ primes for x > Xq{S), the weaker result would not 
have been sufficient for me) to get my first proof of the PNT. I told Erdos 
about it the next morning (Monday, July 19). He then suggested that we 
should talk about it that evening in the seminar room in Fuld Hall (as 
I thought, to a small informal group of Chowla, Straus and a few others 
who might be interested).’' 

In the same letter Selberg goes on to dispute Straus’ recollection of the events. 

“Turan’s lecture (probably a quite informal thing considering the small 
group) could not have been later than July 14, since it was before my 
return. Straus has speeded up events; Erdos told me he was trying to 
prove — » 1 on July 15. He told me he had a proof only late on July 
16 or possibly earlier the next day. Straus’ quote is also clearly wrong for 
the following reasons; first, I needed more than just > 1 for my first 

proof of the PNT, second, I only saw how to do it on Sunday, July 18. 

It is true, however, as Erdos’ and Straus’ stories indicate, that when I first 
was told by Erdos that he was trying to prove > 1 from my formula, 

I tried to discourage him, by saying that I doubted whether the formula 
alone implied these things. I also said I had constructed a counterexample 
showing that the relation in the form 



f(x)logx + f (y) df{t) = 2a: log a; + (0(x)) 

by itself does not imply that f(x) ~ x. It was true, I did have such an 
example. What I neglected to tell that in this example f(x) (though posi- 
tive and tending to infinity with x) was not monotonic! This conversation 
took place either in the corridor of Fuld Hall or just outside Fuld Hall so 
without access to a blackboard. This attempt to throw Erdos off the track 
(clearly not succeeding!) is somewhat understandable given my mood at 
the time. 

Quote from Selberg’s Paper, “An elementary proof of the prime-number theorem,” Annals 
Math. 1949 

“Prom the Fundamental Inequality there are several ways to deduce the 
prime number theorem ... The original proof made use of the following 
result of Erdos — > 1. Erdos’s result was obtained entirely independent 

of my work.” 
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Selberg's first proof that the prime number theorem followed from the fundamental 
formula is given both in [E] and [S]. The crux of the matter goes something like this. We 
may write the fundamental formula in the form 

^ ^ ^ 2 + of-4^V 

X x/p plog(x) \log(a:) / 



Recall that a and A are the limit inferior and limit superior, respectively, of • 

Now, choose x large so that is near A. Since a + A = 2, it follows from the 
fundamental formula and 

log(p) ^ ^ 

;^p\og{x) 



that must be near a for most primes p < x. If <S denotes the set of exceptional 

primes, then we have 




pGcS 



iQg(p) _ 

P 



Now, choose a small prime q ^ S such that is near a. Rewriting the fundamental 

formula with x replaced by x/q, the same argument as above leads one to conclude that 
near A for most primes p < x/ q. \i follows that {){x/p) ^ ax/p for most primes 
p < X and that i3{x/pq) ^ Ax/pq for most p < x/q. A contradiction is obtained (using 
Erdos’s idea of nonoverlapping intervals) unless a = A = 1. 



The Erdos-Selberg dispute arose over the question of whether a joint paper (on 
the entire proof) or seperate papers (on each individual contribution) should 
appear on the elementary proof of the PNT. 



August 20, 1948: Quote from a letter of Selberg to Erdos. 

‘AVhat I propose is the only fair thing: each of us can publish what he has 
actually done and get the credit for that, and not for what the other has 
done. 

You proved that 



lim ^ = 1. 

n — oo p,, 

I would never have dreamed of forcing you to write a joint paper on this 
in spite of the fact that the essential thing in the proof of the result was 
mine.” 

“Since there can be no reason for a joint paper, I am going to publish my 
proof as it now is. I have the opinion, . . . that I do you full justice by 
telling in the paper that my original proof depended on your result. 
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In addition to this I offered you to withliold iny proof so your theorem 
could be published earlier (of course then without mentioning PNT). 

I still offer you this. . . 

If you don’t accept this I publish my proof anyway.” 

Sept. 16, 1948: Quote from Selberg’s letter to Weyl. 

“when I came to Syracuse I discovered gradually through various sources 
that there had been made quite a publicity around the proof of the PNT. I 
have myself actually mentioned it only in one letter to one of my brothers 

Almost all the people whom the news had reached seemed to attribute the 
proof entirely or at least essentially to Erdos, this was even the case with 
people who knew my name and previous work quite well.” 

Quote from E.G. Straus’ , “The elementary proof of the prime number theorem.” 

“In fact I was told this story (I forget by whom) which may well not be 
true . . . When Selberg arrived in Syracuse he was met by a faculty 
member with the greeting: ” 

Have you heard the exciting news of what Erdos and some Scandinavian 
mathematician have just done? ” 

Quote from Selberg’s letter to D. Goldfeld, January 6, 1998: 

“This is not true. What I did hear shortly after my arrival were some 
reports (originating from the Boston-Cambridge area) where only Erdos 
was mentioned. Later there were more such reports from abroad.” 

Sept. 20, 1948: We quote from a second letter of Selberg to Erdos. 

“I hope also that we will get some kind of agreement. But I cannot accept 
any agreement with a joint paper. 

How about the following thing. You publish your result, I publish my 
newest proof, but with a satisfactory sketch of the ideas of the first proof 
in the introduction, and referring to your result. I could make a thorough 
sketch on 2 pages, I think, and this would not make the paper much longer. 

If you like, I could send you a sketch of the introduction. 

I have thought to send my paper to the Annals of Math. , they will certainly 
agree to take your paper earlier.” 

Sept. 27, 1948: Quote from Erdos’s letter to Selberg. 

“I have to state that when I started to work on — > 1 you were very 

doubtful about success, in fact stated that you believe to be able to show 
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that the FUND. LEMMA does not imply the PNT (prime number theo- 
rem) . 

If you would have told me about what you know about a and A, I would 
have finished the proof of PNT on the spot. 

Does it occur to you that if I would have kept the proof of ^ 1 to 

myself (as you did with a -\- A = 2) and continued to work on PNT ... I 
would soon have succeeded and then your share of PNT would have only 
been the beautiful FUNDAMENTAL LEMMA. 

Sept. 27, 1948: Quote from Erdos’s letter to Selberg. 

“I completely reject the idea of publishing only 

lim ^ = 1. 

n-^oo 

and feel just as strongly as before that I am fully entitled to a joint paper. 

So if you insist on publishing your new proof all I can do is to publish our 
simplified proof, giving you of course full credit for your share (stating that 
you first obtained the PNT, using some of my ideas and my theorem). 

Also, I will of course gladly submit the paper to Weyl first, if he is willing 
to take the trouble of seeing that I am scrupulously fair to you. 

Quote from E.G. Straus: “The elementary proof of the prime number theorem.” 

“It was Weyl who caused the Annals to reject Erdos’s paper and pub- 
lished only a version by Selberg which circumvented Erdos’s contribution, 
without mentioning the vital part played by Erdos in the first elementary 
proof, or even the discovery of the fact that such a proof was possible.” 

Quote from Selberg’s letter to D. Goldfeld, January 6, 1998: 

“This is wrong on several points, my paper mentioned and sketched in 
some detail how Erdos’s result played a part and was used in the first 
elementary proof of PNT, but that first proof was mine as surely as Erdos’ 
result was his. Also the discovery that such a proof was possible was surely 
mine. After all, you don’t know that it is possible to prove something until 
you have done so!” 

Excerpt: Handwritten Note by Erdos: 

“It was agreed that Selberg’s proof should be in the Annals of Math., 
mine in the Bulletin. Weyl was supposed to be the referee. To my great 
surprise Jacobson the referee. . . The Bulletin wrote that the referee does 
not recommend my paper for publication. 

I immediately withdrew the paper and planned to publish it in the JLMS 
but . . . had it published in the Proc. Nat. Acad.” 
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Feb. 15, 1949: Quote from H. Weyl’s letter to Jacobson 

“I had questioned whether Erdos has the right to publish things which 
are admittedly Selberg’s. . . I really think that Erdos’s behavior is quite 
unreasonable, and if I were the responsible editor I think I would not be 
afraid of rejecting his paper in this form. 

But there is another aspect of the matter. It is probably not as easy as 
Erdos imagines to have his paper published in time in this country if the 
Bulletin rejects it. . . So it may be better to let Erdos have his way. No 
great harm can be done by that. Selberg may feel offended and protest 
(and that would be his right), but I am quite sure that the two papers - 
Selberg’s and Erdos’s together - will speak in unmistakable language, and 
that the one who has really done harm to himself will be Erdos.” 

Quote from E.G. Straus: “The elementary proof of the prime number theorem.” 

“The elementary proof has so far not produced the exciting innovations in 
number theory that many of us expected to follow. So, what we witnessed 
in 1948, may in the course of time prove to have been a brilliant but 
somewhat incidental achievement without the historic significance it then 
appeared to have.” 

Quote from Selberg’s letter to D. Goldfeld, Januar}^ 6, 1998: 

“With this last quote from Straus, I am in agreement (actually 1 did not 
myself expect any revolution from this). The idea of the local sieve, how- 
ever, has produced many things that have not been done by other meth- 
ods.” 

Remark: To this date, there have been no results obtained from the elementary proof of 
the PNT that cannot be obtained in stronger form by other methods. Other elementary 
methods introduced by both Selberg and Erdos have, however, led to many important 
results in number theory not attainable by any other technique. 

Dec. 4, 1997: Letter from Selberg to D. Goldfeld. 

“The material I have is nearly all from Herman Weyl’s files, and was given 
to me probably in 1952 or 1953 as he was cleaning out much of his stuff 
in Princeton, taking some to Zurich and probably discarding some. The 
letters from Weyl to myself was all that I kept when I left Syracuse in 
1949, all the rest I discarded. Thus there are gaps. Missing is my first 
letter to Erdos as well as his reply to it. . . 

I did not save anything except letters from Weyl because 1 was rather 
disgusted with the whole thing. I never lectured on the elementary proof 
of the PNT after the lecture in Syracuse, mentioned in the first letter to 
Herman Weyl. However, I did at Cornell U. early in 1949 and later at 
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an AMS meeting in Baltimore gave a lecture with an elementary proof 
of (using the notation of Beurling generalized primes & integers) the fact 
that if 

then 

Beurling has the same conclusion if 

N{x) = Ax + O f ^ ^ , 

with Q' > |. I never published this. 

Erdos of course lectured extensively in Amsterdam, Paris, and other places 
in Europe. After his lecture in Amsterdam, Oct. 30, 1948, v.d. Corput 
wrote up a paper, Scriptum 1, Mathematisch Centrum, which was the first 
published version!” 

References 

[A-B-G] K.E. Aubert, E. Bombieri, D. Goldfeld, Number theory, trace formulas and 
discrete groups, symposium in honor of Atle Selberg, Academic Press Inc. Boston (1989). 

[A— S] N. Alon, J. Spencer, The probabilistic method, John Wiley & Sons Inc., New York 
(1992). 

[B] H. Bohr, Address of Professor Harold Bohr, Proc. Internal. Congr. Math. (Cam- 
bridge, 1950) vol 1, Amer. Math. Soc., Providence, R.I., 1952, 127-134. 

[Ch] J. Chen, On the representation of a large even integer as the sum of a prime and the 
product of at most two primes, Sci. Sinica 16 (1973), 157-176. 

[C— G] F. Chung, R. Graham, Erdos on graphs: his legacy of unsolved problems, A.K. 
Peters, Ltd., Wellesley, Massachusetts (1998). 

[C] L.W. Cohen, The annual meeting of the society. Bull. Amer. Math. Soc 58 (1952), 
159-160. 

[D] H.G. Diamond, Elementary methods in the study of the distribution of prime numbers. 
Bull. Amer. Math. Soc. vol. 7 number 3 (1982), 553-589. 

[E] P. Erdos, On a new method in elementary number theory which leads to an elementary 
proof of the prime number theorem, Proc. Nat. Acad. Scis. U.S.A. 35 (1949), 374-384. 

[G] D. Goldfeld, The Erdos-Selberg dispute: file of letters and documents, to appear. 

[HI] J. Hadamard, Etude sur les proprietes des fonctions entieres et en particulier d’une 
fonction consideree par Riemann, J. de Math. Pures Appl. (4) 9 (1893), 171-215; reprinted 
in Oeuvres de Jacques Hadamard, C.N.R.S., Paris, 1968, vol 1, 103-147. 




192 



[H2] J. Hadamard, Sur la distribution des zeros de la fonction C(s) et ses consequences 
arithmetiques, Bull. Soc. Math. Prance 24 (1896), 199-220; reprinted in Oeuvres de 
Jacques Hadamard, C.N.R.S., Paris, 1968, vol 1, 189-210. 

[Ho] P. Hoffman, The man who loves only numbers, The Atlantic, November (1987). 

[I] A.E. Ingham, Review of the two papers: An elementary proof of the prime-number 
theorem, by A. Selberg and On a new method in elementary number theory which leads 
to an elementary proof of the prime number theorem, by P. Erdos. Reviews in Number 
Theory as printed in Mathematical Reviews 1940-1972, Amer. Math. Soc. Providence, RI 
(1974). See N20-3, Vol. 4, 191-193. 

[Lai] E. Landau, Handbuch der Lehre von der Verteilung der Primzahlen, Teubner, Leipzig 
(1909), 2 volumes, reprinted by Chelsea Publishing Company, New York (1953). 

[La2] E. Landau, Uber den Wienerschen neuen Weg zum Primzahlsatz, Sitzber. Preuss. 
Akad. Wiss., 1932, 514-521. 

[Lei] A.M. Legendre, Essai sur la theorie des nombres, 1. Aufl. Paris (Duprat) (1798). 

[Le2] A.M. Legendre, Essai sur la theorie des nombres, 2. Aufl. Paris (Courcier) (1808). 

[S] A. Selberg, An elementary proof of the prime-number theorem, Ann. of Math. (2) 
50 (1949), 305-313; reprinted in Atle Selberg Collected Papers, Springer-Verlag, Berlin 
Heidelberg New York, 1989, vol 1, 379-387. 

[Syll] J.J. Sylvester, On Tchebycheff ’s theorem of the totality of prime numbers comprised 
within given limits, Amer. J. Math. 4 (1881), 230-247. 

[Syl2] J.J. Sylvester, On arithmetical series. Messenger of Math. (2) 21 (1892), 1-19 and 
87-120. 

[Tchl] P.L. Tchebychef, Sur la fonction qui determine la totalite des nombres premiers 
inferieurs d une limite donnee, Memoires presentes a I’Academie Imperiale des Sciences de 
St.-Petersbourg par divers Savants et lus dans ses Assemblees, Bd. 6, S. (1851), 141-157. 

[Tch2] P.L. Tchebychef, Mem,oire sur les nombres premiers, J. de Math. Pures Appl. (1) 
17 (1852), 366-390; reprinted in Oeuvres 1 (1899), 49-70. 

[VP] C. J. de la Vallee Poussin, Recherches analytiques sur la theorie des nombres premiers, 
Ann. Soc. Sci. Bruxelles 20 (1896), 183-256. 

[Wl] N. Wiener, A new method in Tauberian theorems, J. Math. Physics M.I.T. 7 (1927- 
28), 161-184. 

[W2] N. Wiener, Tauberian theorems, Ann. of Math. (2) 33 (1932), 1-100. 




ADDITIVE BASES REPRESENTATIONS 
AND THE ERDOS-TURAN CONJECTURE 



G. Grekos, L. Haddad, C. Helou* , J. Pihko 



Abstract. We give a lower bound to the maximal number of representations by an additive 
basis of the natural numbers, in conjunction with a celebrated conjecture of Erdos and Turan. 



Introduction 

Denote by N = {0, 1, 2, ... } the set of natural numbers and consider a subset A of N. 
The number r(A, n) of representations of an element n of N by is the number of ordered 
pairs (a,b) e A X A such that a -h b = n. We will say that >1 is a basis of N if r(A, n) > 1 
for all n G N. Our main objective is the exploration of the following conjecture of P. Erdos 
and P. Turan [3], dating back to 1941. 

(ET): If ^ is a basis of N, then r(A, n) is unbounded, as n ranges through N. 

There seems to be relatively little work concerned with this conjecture. Originally, Erdos 
and Turan used some deep function theory to show that r{A,n) does not become constant 
for large n. But, in 1951, G. Dirac [1] proved this result by an elementary argument 
concerning the parity of r(A,n). In 1956, P. Erdos and W. Fuchs [4] established, using 
Fourier series, that it is impossible to have k) = cn + o(n^/^(log They 

also showed that if A = {ai < U 2 < • • • < Un < • • • } satisfies the condition an < Kn^ for 
some constant K > 0 and all n G N (which is true of every basis of N), then for any c > 0, 
one has limsup„_^ ^ ELo(^(^. ~ 0. They further asserted the existence of a 

sequence A satisfying the same condition, for which limsup„_^oQ ^ 

In 1990, I. Ruzsa [8] confirmed this assertion, by constructing a basis A of N for which 
Ylk=o'’^(A, k)"^ = 0(n). In 1988, M. Dowd [2] gave a finite form of (ET) in N, equivalent 
to one of our formulations. His proof, using graph theory, was recently clarified and 
generalized by M. Nathanson [7]. Dowd also indicated that the validity of (ET) in Z 
implies its validity in N. But, in 2002, Nathanson [6] showed that (ET) is not valid in Z, 
by constructing arbitrarily sparse bases A of Z for which r(A, n) is at most 2 for all n G Z. 

Here, we will introduce some functions, defined in terms of a variable bound x in N 
and of the traces of all bases A of N on the interval [0, x], involving the values of r(A, n). 
This allows for equivalent formulations of the Erdos- Turan conjecture susceptible of partial 
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quantitative verifications. In particular, we deduce that for any basis A of N, the numbers 
r{A,n) must at least take values > 6. For more ample details, we refer to [5]. 

§1 Some functions describing finite traces of bases 

For G N and P C N, we write P[x] = P D [0, rc] = {p G P : p < a:}, and we set 
p(P,x) = max{r(P, n) : n G N[a;]}. We also set s(P) = sup{r(P, u) : n G N}; this is an 
element of N N U {oo}. 

The following properties are immediate but useful: If p G N is such that x < y, then 
p{p, x) ^ p{P^y)\ ^1^4 if 4 C P, then p{A^x) < p(P,x). Moreover, p(P, .t) = p{P[x]^ x). 
Also, s(P) = lim p(P, .t). 

x->oo 

A set P will be called a basis of N[x] if P C N[a;] C P T P, where P -f- P == {p + g : 
(p? q) ^ P ^ P}] whereas P is a basis of N if P + P = N. The set of all bases of N[a:] will 
be denoted by B{x), and that of all bases of N by P(N). Naturally, P G P(N) if and only 
if P[x] G B(x) for every a; G N. 

The functions p and r, from N into N, are defined by p{x) = min{p(P, a;) : P G B{x)} 
and r{x) = min{5(P) : P G B{x)}. 

Lemma 1. 

(1) The functions p and r are increasing. 

(2) For every x we have p{x) < r[x) < p{2x). 

Proof. 

(1) Let x,y e N be such that x < y. For any Q G P(p), we have Q[x] G B[x) and 
therefore p(Q,y) > p{Q,x) = p{Q[x],x) > p{x) and s{Q) > 5(Q[a:]) > r{x). Hence 
p{y) = min{p(Q,p) : Q G B{y)} > p[x) and r(p) = min{5(Q) : Q G B{y)} > r{x). 

(2) The inequality p{x) < r{x) follows from the obvious fact that p(P, x) < p(P, 2a;) = 

s{P), for all P G B{x). Moreover, for any Q G P(2a;), since Q[x] G B{x), we have r{x) < 
5(Q[a:]) = p{Q[x],2x) < p{Q,2x). Hence r{x) < mm{p(Q,2x) : Q G B{2x)} = p{2x). □ 

A third significant function is : N* — >■ N*, where N* = N \ {0}. It is defined by 
a{n) = min{5(P) : P G P(#n)}, where P(#n) - {P C N : |P| = n and P G P(maxP)}. 
A finite, non-empty subset P of N such that P G P(maxP) will simply be called a finite 
basis, so that B{#n) is the set of all finite bases having exactly n elements. Furthermore, 
we will call successor of a finite basis P every finite basis Q obtained by adjoining to P 
an element q > maxP, so that Q = P U {g'} and Q G B{q). Clearly, Q = P U is a 
successor of P if and only if maxP -h 1 < ^ < h, where h — min(N \ (P + P)). It is also 
easy to see that B{#{n + 1)) consists exactly of the successors of the elements of B{#n). 

Lemma 2. 

(1) The function a is increasing. 

(2) For any n G N*, we have 1 < cr(n) < n. 

Proof. 

(1) For any n G N* and any Q G B{#{n H- 1)), there is some P G B{#n) such that 
Q is a successor of P, namely P = Q \ {maxQ}. Then s{Q) > s{P) > a(n). Hence 
cr(n T 1) = min{s(Q) : Q e B{#{n + 1))} > a{n). 
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(2) For any finite subset P of N and any x G N, the number r(P, n), of ordered pairs 
(p, 71 — p) such that p ^ P, cannot exceed the cardinality |P| of P. Therefore s{P) < 

In particular, if P E J3{#n)^ then s(P) < \P\ = n. Hence a{n) < n. Moreover, by (1) 
above, (r{n) > cr(l) — 1. □ 

Lemma 3. For any x E N, if P E B{x), then |P| > (\/Sx + 9 — l)/2. 

Proof. Let n == |P| and P = {pi, . . . ,pn}, where pi < • • • < Pn- Since N[a:] C P + P, we 
have |N[x]| = x P 1 < |P + P|. Moreover, P + P — {pi + Pj : 1 < 7 < i < ti} and therefore 
|P + P| < j — + l)/2. Thus x-\-l < n{n + l)/2, i.e. n‘^ + n — 2{x + 1) > 0, which 

implies that n > (— 1 + \/8rr + 9)/2 (the positive root of the quadratic equation). □ 
Corollary. For any rz E N*, we have r{n — 1) < cr{7i) < r{n{n + l)/2 — 1). 

Proof. For any P E B{#n)^ since P E P(maxP), we have s{P) > r(maxP); and since 
P has n elements, we have maxP > n — 1, so that, r being an increasing function, 
s{P) > r(maxP) > r(n — 1). Hence o-{n) = min{5(P) : P E B{#n)} > r{n — 1). 

On the other hand, if a: = n{n + l)/2 — 1 then, by Lemma 3, for any P E B(x), we 
have |P| > n. Thus, a being an increasing function, s{P) > cr{\P\) > a{n). Hence 
T{n{n -f l)/2 - 1) = min{5(P) : P E B{x)} > a{n). □ 

The problem considered here can be more generally stated as the determination of the 
element A = inf{5(P) : P E B{N)} of N. Indeed, the Erdos- Turan conjecture amounts to 
the assertion that A = cxd. 

Lemma 4. We have lim p{x) = lim r{x) = lim a{x) < A. 

X— >oo x—>oo x—^oo 

Proof. Since the functions p, r and a are increasing, they all have limits in N, as a; — > oo. 
The first equality then follows from Lemma 1, (2), and the second equality follows from 
the above Corollary. Moreover, for any a: E N and any P E P(N), since P[x] E B{x), we 
have r{x) < 5(P[a;]) < s{P). Hence r{x) < A, for all a; E N, and thus lim r{x) < A. □ 

x—yoo 

§2 Equivalent formulations of the Erdos-Turan conjecture 

We will need the following important set-theoretic notions and results. 

Definition. By an infinite family of subsets ofN, we will mean any family V = (Pi)i^j 
of s7ibsets of N whose index set I is infinite. Furthermore, a subset A of N will be called a 
diagonal of such a family V, if for any 72 E N, there are infinitely many indices i ^ I such 
that Pi has the same trace as A on the interval [0,7z], i.e. such that A[n] = Pi[n]. 

For instance, if P = (Pi)ig/ is an infinite increasing, for inclusion, sequence of subsets 
of N (i.e. 7 C N and for i,j E 7, if z < j then Pi C Pj), then Uie/ diagonal of 

P. But in general, it is not obvious whether an arbitrary infinite family P has a diagonal. 
As a matter of notation, for any set X, we denote by S{X) the set of all subsets of X. 

Lemma 5 (The Diagonal Lemma). Every infinite family V = {Pi)i^j of subsets ofN 
has at least one diagonal A C N. 

Proof. We construct, by induction on n, an increasing sequence of subsets An of N and 
a decreasing sequence of infinite subsets 7„ of 7 such that for any n E N and any z E 7„, 
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we have — Pi[n\. Then we let A — IJugn verify that A is a diagonal of the 

family V. 

For n = 0, let Fq : I — > <S({0}) be the map defined by Fq( 0 = FjO], for all z G /. Since 
I = Fj"^(0) U F(^^({0}) is an infinite set, then, one at least of the two sets 0 or {0}, that 
we call Aq^ has an infinite preimage /q = Fq^{Aq). For n G N*, we assume constructed 
2n subsets Aq C • • • C An-i C N and In-i C • • • C /q C /, such that In-i is infinite and 
Aj = Pi[j] for all i G Ij and ^ < j < n — 1. Let F„ : 7„_i — > <S(N[n]) be the map defined 
by Fr^{i) = Pi[n], for all i G In-i- Since <S(N[n]) is finite and In-i = (JxeS{N[n]) 
is infinite, there exists an element An G «S(N[ti]) such that = F~^{An) is an infinite 
subset of In-i- Since /„ C /n-i, then, for n G /n, we have An-i = Pi[n - 1] C Pi[n] = An- 
This completes the construction by induction. 

Now, letting A = UfcGN^fc’ n G N. But since the 

sequence {A^) is increasing, for 0 < A: < n, we have Ak[n] C An[n] = Moreover, for 
k > n, since Ik C In, for any z G 7^, we have Ak[n] = {Pi[k])[n] = Pi[n] = An- Hence 
A[n] = An = Pi[n], for all n in N and all z in the infinite set In- Thus A is a diagonal of 
the family V. □ 

Corollary 1. Let V = (Fi)ie/ infinite family of subsets ofN, with I C N, and let 

A be a diagonal of this family. 

(1) If Pi ^ B{i) for all i e I, then A G B{N). 

(2) Ifs{Pi)<s, for some s G N and all i E I , then s(A) < s. 

(3) If Pi G B{i) and s{Pi) = r(z) for all i ^ I, then A G B{N) and we have s{A) = 
lim t(x) = A. 

X— f oo 

Proof (1) and (2). For every n G N, there is an infinite subset 7„ of 7 such that for all 
z G In, we have A[n] = Pi[n] and therefore r{A,n) = r{Pi,n). Thus if Pi G B{i) (resp. 
s{Pi) < s) for all i e I, then, choosing z > n in 7„, we get r{A,n) = r{Pi,n) > 1 (resp. 
r{A,n) = r{Pi,n) < s{Pi) < s). Since this holds for every rz G N, we conclude that 
A G F(N) (resp. s{A) < s). 

(3) If Pi G B{i) and s{Pi) = r{i), for all z G 7, then, by (1), A G B(N). Moreover, for any 
n G N, there is an infinite subset In of 7 such that for all z G 7„, we have A[n] — Pi[n] and 
therefore r{A,n) = r(Fi,n) < s(Pi) = r(z) < lim r(x), since r is an increasing function. 

X— >oo 

It follows that s(A) < lim r{x). On the other hand, by Lemma 4 and the definition of A, 

x—^oo 

we have lim r{x) < A < s{A). Hence the equalities. □ 

X— >oo 

Corollary 2. We have lim p{x) = lim t{x) = lim a{x) = A. 

x—yoo X— foo X— ^oo 

Proof. For every z G N, choose from the finite set B(i) an element Pi at which the map 
P i-> s(P) attains its minimum r(z). Then the family V = {Pi)^^-^ satisfies the conditions 
Pi G B{i) and s{Pi) = r(z) for all z G N. Hence, by Corollary 1, (3), lim r(x) = A. The 

x—¥oo 

other equalities result from Lemma 4. □ 

Theorem 1. The following statements are equivalent: 

(ET): If A is a basis ofN, then s(A) = oo; i.e. A = oo. 

(ETp): lim p{x) = oo. 
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(ETt): lim r{x) = oo. 

x—^oo 

(ETa): lim cr(x) = oo. 

a: -400 

Proof. This is an immediate consequence of the above Corollary 2. □ 

§3 On the growth of the functions p, r and a 

We will need an auxiliary function a : N — > N. For G N and for a subset P of 
N, we first set a{P,x) — max{r(P, n) + r{P,n + a; + 1) : n € N[a;]}; we then define 
a{x) = mm{a{P,x) : P G 

Lemma 6. For any x gN, we have p{x) < t{x) < a{x) < min(2r(a;) , [{x S) / 2]) , where 
[r] denotes the integral part of the real number r. 

Proof. For any P C N, we have a{P,x) = max{r(P, n) + r{P^n + a; + 1) : n G N[a;]} > 
r(P,fc), for 0< k < 2x 1. It follows that a{P^x) > p(P, 2a; + 1). Moreover, a{P^x) < 
max{r(P, 7^) : n G N[a;]} + max{r(P, n + x + 1) : n G N[x]} < /a(P, a;) + /a(P, 2a; + 1) < 2s{P). 
In particular, if P C N[a;], then p(P, 2a; + 1) = s(P) and therefore s{P) < a{P^x) < 2s{P). 
Taking the minimum, as P ranges through B{x), of all sides in the latter inequalities, we 
get t{x) < a{x) < 2r{x). In addition, by Lemma 1, p{x) < r{x). This yields all the 
desired inequalities except one; so there only remains to show that a{x) < [(x + 3)/2]. 

Now, let m = [(x + l)/2] and A = N[m]. Then A G B{x) and thus a{x) < a{A,x). 
Moreover, a{A,x) < max{r(A,n) : n G N[x]} = p{A,x). Indeed, ifn > 1, then n + x + 1 > 
2m and thus r{A,n + x + 1) =0; while if n = 0, then r(A, 0) + r{A,x + 1) = r(A, x), 
since both sides are equal to 1 or 2 according as x = 2m or x = 2m — 1 respectively. 
Furthermore, s{A) = r{A, m) = m + 1, since it can be easily checked that r(A, n) — n 1 
if 0 < n < m and r(A, n) = max(2m — n + 1,0) ifn>m + l. Therefore a{x) < a{A^ x) < 
p{A,x) < s{A) = m + 1 — [(x -t- 3)/2], which completes the chain of inequalities. □ 

Another concept that we need is that of generating power series. For every subset P of N, 
there is an associated formal power series fp{X) = , where 

x(P, .) is the characteristic function of P, defined by x(P, n) = 1 if n G P and x{P^ n) = 0 if 
n G N\P. The square of this series gp{X) = /p(X)^ = is the generating 

series of the sequence (r(P, n))neN, since r(P, n) — x(P^ k)x{P, n — k), for any n G N. 

For instance, \i A = N[m], where m G N, then fA{X) — — (1 ~ X”^+^)/(I — X) 

and therefore gA{X) = {I - X'^+^f / (I - Xf - (l - 2A-+' + + = 

-h 1)A” + Z)n=m+i(2^ - n + l)x^, which gives the values of r(A, n) for all n G N 
as noted in the proof of Lemma 6. 

Lemma 7. Let A, B he two subsets ofN and d G N be such that A is finite and d > max A. 
Let C = A -G d ^ B — {a A db : a G A^h G B}. Then 

(1) fc { X ) = fA { X ) fB { X ^). 

(2) For any n G N, there exist unique integers g, e G N such that n = dq A e and 
0 < e < d, and we have r((7, n) = r(A, e)r{B, q) -f r(A, d + e)r(P, ^ — I). 

(3) For any x G N, we have p(C^ x) < a(A, d — l)p(P, [x/d]) 
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Proof. (1) We have C = [JaeAi ^~^ ^ where the sets aPd^B {a} Pd^B are two by 
two disjoint. Indeed, if for a, a' G with a < a', there is some c G {a-\-d*B)n{a' Pd^B), 
then c = a + db = a' -j- db', with 6,6' G B, and therefore 0 < a' — a = d{b — 6') < 
a' < max^ < d, which is only possible if 6 — 6' = 0, i.e. a = a'. It follows that 
fc{X) = Ea6A E6eB = Eae.1 ^“/b(^") = fA{X)fB{X^). 

(2) Squaring the relation in (1), we get gc{X) = gA{X)gB{X'^), i.e. n)X'^ = 

(Cll^=Qf'{A,n)X'^) (Xl^o Identifying the coefficients of X^ on both sides, 

for n G N, we get r{C,n) = where the summation is over all (j^k) G 

N X N such that j dk = n. The latter sum can be restricted to 0 < 7 < 2m, where 
m = max A, since r(A,j) = 0 for j > 2m. Now, for a given n G N, the existence and 
uniqueness of q and e result from the Euclidean division of n by d. Since n = dg + e, with 
0 < e < d, the condition j dk = n amounts to j = d[q — /c) + e, with the restriction 
0 < d(g — k) -\- e < 2m < 2d., so that 0 < g — < 2, i.e. either k = j = e or 

k = q — 1, j = dp e. Therefore r(C, n) = r{A, e)r(J5, g) + r{A, d -f e)r{B, q — 1). 

(3) Let X = dup where G N satisfy 0 < < d and are uniquely determined by 

Euclidean division. Then for 0 < n < x, similarly expressed by n = dg -f e, with g, e G N 
and 0 < e < d, we have 0 < g < n, so that r(B,q— 1) and r{B, g) are < p{B, u). Therefore, 
by (2), r(C, n) = r{A, e)r{B, g) + r{A, d -f e)r{B, g - 1) < {r{A, e) P r{A, d P e))p{B, u), 
for all 0 < n < X. Hence p(C, x) < max{r(yl, e) + r(A, dpe):0<e<d — l}p{B, u) = 
a{A,d — l)p{B,u), which is the stated inequality, since u = [x/d]. □ 

Lemma 8. Let x, ?/ G N, A e B{x) and B G B{y), and let C = A P {x P 1) * B . Then 

(1) C G B{xy P xPy). 

(2) p(C,xy P X Py) < a{A,x)p{B,y). 

(3) s{C) <a{A,x)s(B). 

Proof. 

(1) Let d = x + l and 2; = xypxpy. Then C — Apd^B and 2: = dypx, with 0 < a: < d; 
furthermore 0 < max A < x < d. So, by Lemma 7, for any 0 < n < 2, if n = dg -f e with 
0 < e < d, then 0 < g < g and r(C, n) = r(A, e)r{B, q) -f r{A, d P e)r{B, g — 1). Since 
A G B{x) and e < d — 1 = x, then r{A,e) > 1 and since B G B{y) and q < y, then 
r{B,q) > 1. Therefore r{C,n) > r{A,e)r{B^q) > 1, for all 0 < n < 2, i.e. N[2] C. C P C. 
Moreover, C = Ap d^ B <Z N[x] + (a: + 1) * N[g] C N[z]. Hence C G B{z). 

(2) By Lemma 7, and since d — 1 = a; and [z/d] = 2/, we have p{C,z) < a{A,x)p{B,y). 

(3) Since C C N[z], we have s(C) = p{C^2z). But, by Lemma 7, we have p{C,2z) < 

a{A^x)p{B,[2z/d]) < a{A,x)s{B). Hence s{C) < a{A^x)s{B). □ 

Theorem 2. For any x,y eN, we have 

(1) p{xyPxPy)<a{x)p{y). 

(2) r{xyPxPy) <a{x)T{y). 

Proof. Let z = xy P x P y. By Lemma 8, for any A G B{x) and any B G B{y), there exists 
C G B{z) satisfying p{z) < p{C,z) < a{A,x)p{B^y) and r{z) < s{C) < a{A,x)s{B). 
Hence p{z) < min{Q;(yl, a:) : A G B{x)} • mm{p{B,y) : B G B{y)} = a{x) • p{y) and 
similarly r{z) < min{o;(A,a;) ; A G B{x)} • min{s(B) : B G B{y)} — a{x) • T{y). □ 
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Corollary. For any x,y eN, we have 

(1) p{xy + X -\-y) < 2r{x)p{y), and also p{xy x y) < [{x 3)/2]p{y). 

(2) T{xy X -^y) < 2T{x)r{y), and also T{xy + x y) < [{x 3)/2]r{y). 

Proof. The inequalities follow immediately from Theorem 2 and Lemma 6. □ 

Proposition. For any x eW , if p{x + 1) > p{x), then r{x) > p{x). 

Proof Assume that p{x -f 1) > p{x). U B e B{x) is such that p{B,x) = p{x), then 
B G B{x -f- 1); for otherwise, we would have r{B,x + 1) = 0 and therefore C = B U 
{x + 1} would lie in B{x -h 1) and satisfy r(C,x 4- 1) = 2, so that p{x 4- 1) < p{C,x 4- 
1) = max{p{B,x),r{C,x-\-l)) = p{B,x) = p{x) < p{x P 1), in contradiction with the 
assumption. It follows that B £ B{x) is such that p{B^x) = p{x), then s(B) > p{x)] 
indeed, B being in B{x 4-1), we have s{B) > p{B,x 4- 1) > p{x 4- 1) > p{x), by the 
assumption. Therefore, for all B G B{x), we have s{B) > p{x)\ indeed, this was proved if B 
satisfies the condition / 9 (jB, x) = p(a:), and if it does not satisfy this condition we would have 
s{B) > p{B,x) > p{x). We thus conclude that t(x) = mm{s{B) : B G B{x)] > p{x). □ 

§4 Numerical results 

For a; G N, a basis B G B[x) is called p-optimal (resp. r-optimal) if p{B,x) — p{x) (resp. 
s{B) = t{x))\ also, if 5 G B{#x) is such that s{B) = a[x), it is called a cr-optimal basis 
of cardinality x. The set of p-optimal (resp. r-optimal) bases in B{x) is written C?{p,x) 
(resp. 0(r, j:)), and the set of ^r-optimal bases of cardinality x is written 0{a,x). To com- 
pute p{x), r{x) or cr(x), we have to determine at least one corresponding optimal basis. 
But the number of bases grows so rapidly with x that exhaustive searches are impossible 
for large x. For instance, |B(15)| = 8134, while |(9(p, 15)| = 155 and |(9(r, 15)| = 102; 
also, |B(18)| = 63910. On the other hand, |B(#10)| = 47098. As to examples of 
such bases, we mention B = {0,1,2,5,8,11}, which is in 0{p, 13) D 0{r, 13) PI 0{cr,6), 
and also B = {0,1,2,3,4,5,7,9,11,15,21,26,34,35,39,46,54,62,72,79,89,94,101,110, 
128, 137, 150, 153, 166, 182, 193, 206, 218}, which is in 0{p, 223) Pi 0(r, 223) P 0{a, 33). 

Here are some values of the three main functions studied above that were obtained by 
computer calculations, using the software Maple. 

The p function. 

p(0) = 1 

p{x) = 2 for 1 < a; < 5 

p{x) = 3 for 6 < a: < 12 

p(^x) = 4 for 13 < a: < 55 

p{x) = 5 for 56 < a; < 69 

p\x) == 6 for 70 < a: < 233; and p(234) > 6. 

The T function. 

r(0) = 1 

t(x) = 2 for 1 < s < 4 
t{x) = 3 for 5 < a; < 10 
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t{x) = 4 for 11 < X < 45 
t{x) = 5 for 46 < X < 59 
r{x) = 6 for 60 < a: < 223; and r(224) > 6. 

The (7 function. 

fr(l) = 1 

(j{x) = 2 for 2 < X < 3 

cr(x) = 3 for 4 < X < 5 

cr(x) = 4 for 6 < X < 12 

a-(x) -- 5 for 13 < x < 14 

<t(x) = 6 for 15 < X < 33; and fx(34) > 6. 

Theorem 3. We have A > 6; i.e. for any B G H(N), we have s{B) > 6. 

Proof. This results from Lemma 4, the fact that the function p, or r or a, is increasing, 

and from its highest calculated value, listed above. □ 
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Abstract 



Let A be a finite subset of Z^. Let h be a positive integer. Let hA be a sumset 
defined by 

k 

{hia-[ + • • • + h^ak | a,; G A, hj = h] 



where A: = |A|. It is found that the distribution of the elements of hA in the boundary 
region of the convex hull of hA exhibited a repeating pattern. In other words, if each 
side of the boundary region of the convex hull of hA is partitioned into h cells, for 
h sufficiently large, there exists a constant C and there exist a consecutive h — C 
congruent parallelograms such that the elements of hA in each parallelogram can be 
translated by a constant vector to obtain elements of hA in the next parallelogram. By 
counting the number of parallelograms and the cardinality of hA in each parallelogram, 
it can be found that the cardinality of hA in the boundary region is a linear function 
of h. 



1 Introduction 



Many studies have been done on the structure and the cardinality of sum of sets. In par- 
ticular, let h be a positive integer, and let A be a finite subset of Z", the structure and 
the cardinality of the /i-fold sumset of A, denoted by /lA, for sufficiently large h can be 
approximated by studying the convex hull of hA. 
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It was found by Nathanson that when is a set of integers, the structure of the /i-fold 
sumset of A consists of an interval of consecutive integers and the cardinality of hA is a 
linear function of h. It was found by Khovanskii that when is a finite set of lattice points 
in Z”, the structure of the h-fold sumset of A consists of a polytope such that all of the 
lattice points in the polytope are contained in hA. In other words, Khovanskii showed that 
there exists a positive real number p such that all lattice points in the convex hull of hA^ 
whose distance from the boundary is greater than or equal to p, belong to hA. By using the 
volume of the polytope to approximate the cardinality of hA, Khovanskii showed that the 
cardinality of hA is a function of /i”. 

The author examines the structure of sumset from a diflPerent perspective. Both Nathanson 
and Khovanskii studied the ’’core” structure of hA. In other words, Nathanson and Kho- 
vanskii studied the distribution of those elements of hA that constitute the interior lattice 
points of the convex hull of hA. The author will study instead the ’’boundary” structure 
of 7i.4. The focus is on the distribution of those elements of hA that is less than a given 
distance away from the boundary of the convex hull of hA. As shown by Khovanskii, the 
cardinality of the elements of hA in the ’’core” structure is a function of h'^. It is conjectured 
that the cardinality of these ’’boundary” elements of hA is This paper will examine 

the case where A is a finite subset of TA . The author will show that there is a repeating 
and consistent pattern in the ’’boundary” structure of hA and that the cardinality of these 
” boundary” elements is a linear function of h. 



2 Notation and example 



Let denote the convex hull of hA. When we speak of the boundary or the interior of 
hA, we mean the boundary or the interior of AhA- Let x,y,a,b be elements in R^. Let 
l{x,y) denote the line segment connecting the two points x and y. Let d{a,b) denote the 
distance between the two points a and b. Let the set 

{a e I d(a, l{x, y)) = p} 

denote the set of all elements a such that a is exactly p distance away from the line segment 
l{x,y). Similarly, the set 

{a € R^ I d{a,l(x,y)) < p} 

denotes the set of all elements a such that a is less than p distance away from the line 
segment l{x, y). 

If l{x, y) is a boundary line of A/^^, the set 

{a e At, A I d{a,l{x,y)) < p) 

is also referred to as the ’’boundary region.” The elements of hA in the boundary region are 
called the ’’boundary elements” of hA. 
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The following .simple example in 7 ? will illustrate the regularity in the distribution of the 
boundary elements of hA. In fact, the example is so simple that there is a regularity in the 
distribution of the elements of hA throughout the interior of A^a- 

Suppose A is a set of lattice points in such that A contains only three elements. Let 
A = {0,ai,a2}, where 0 = ( 0 , 0 ) and ai ^ ka2 for any real number k. Let h be a positive 
integer, then 

hA = {kiai + /C2a2 | /ci = 0 , 1 , . . . , /i and /c2 = 0 , 1 , . . . , /r — /ci} 

The cardinality of hA is 

= (A+2)(fe+i) ^ y 3ft 

' ' 2 2 2 

The boundary of A/,^ consists of three line segments: /(O, hai), l( 0 Jia 2 ), and l{haijia2)- 
Moreover, the boundary of A/j.^ consists of these elements of hA\ 

{kidi I /ci = 0, 1 , . . . , h-l}u{k 2 a 2 | /C2 = 1, • • - , /i-l}u{A:iai+A:2fl2 | A:i = 0 , 1, . . . , /i, k 2 = h-ki} 
Thus, 

\hA n boundary \ = 3 /i 

A more interesting problem is to be able to examine not just those elements that lie on 
the boundary of A^a, but also those elements of hA that fall within a given distance from 
the boundary. Define the following boundary regions with thickness p: 

Fi = Fi{h, p) = {a € AhA \ d(aj{ 0 ,hai)) < p} 

F2 = F2{h, p) = {ae Af,A I d(a, /(O, ha2)) < p} 

Fs = F^{h,p) {a 6 Aha \ d{a,l{hai,ha 2 )) < p} 



Let mi, m2, ms be three positive integers defined eis follows: 

mi = max{n e | d{na2,l{0, hai)) < p} 

m2 = max{n 6 | d{nai,l{ 0 , ha2)) < p} 

ms = max{n 6 Z~^ \ d{hai — na\, l{hai, ha2)) < p} 

For h sufficiently large, the distance p from the boundary is small in comparison to the 
convex hull of hA, therefore, Fi n F2 n F3 = 0 . The distribution of the elements of hA in the 
boundary region Fi can be described as follows: 

h—rrii 

F\ n hA — T k 2 d 2 I /u2 ~ O5 1 5 • • • 5 ^1} U 

ki^O 

h 

T /C 2 Q '2 I ^2 — 0 , 1 , . . . , /i — ^ 1 } 

ki=h-mi+l 
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h-m2 

F2 n fiA = 4- k 2 d 2 I = 0, 1, . . . , 1712] U 

/C2 = 0 

h 

IJ {k\ai H- k 2 d 2 \ k\ = 0, 1, . . . , /i — k 2 ] 
k2—h-m2 + l 
h—rn^ — l 

F3 n M = IJ {kiQi + k2d2 \ k2 = h - - ki, . . . ,h - ki) [J 

fci=0 

h 

IJ {kidi + k 2 d 2 I /c2 = 0, 1, . . . , /i — ki} 
k\=h—mz 

Note tlicit for i — I7 2, the set Pi HA consists of n disjoint union of h — uii 1 sets, where 
the consecutive sets in Fi P hA differ by ai and the consecutive sets in F2 P hA differ by a2- 
The union of the remaining sets in Fi P hA has a constant cardinality. Thus, for z = 1,2, 
cardinality of Fi P hA is 

\FiDhA\ = {h - rui l)(nii F 1 ) ^ h-k 

k=h—mi + l 

which is a linear function of h. 

Similarly, F^ P hA consists of a disjoint union of h — 7723 sets where the consecutive sets 
differ by ai - a2- The union of the remaining sets also has a constant cardinality. Thus, the 
cardinality of F3 P hA is 



h 

IF3PMI = (/i- 7723)7723+ h-kFl 

k=h~m3 

which is again a linear function of h. 

Furthermore, 

\hA P {Fi P Fj)| = (772i + l)(r?2^- + 1) 

The cardinality of hA in the boundary region can be computed: 



hA P Fi j + I hA P F2 1 + 1 hA P F3 1 

- \hA P (Fi P F2)| - \hA P (F2 P F3)| - \hA P (Fi P F3) 



i=l 2 



'^{rrii + 1 )( t 72 ^- + 1 ) 
i<j 



= Ch-D 



\hA P {Fi U F2 U F3)[ 
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Since mi, m 2 , m 3 are constants, then C and D are also constants where 



C 

D 



nil + ni2 + m3 



= E 



(m, - 2) {mi + 1) 



— 771 ^ + ^ mimj 
i<j 



Thus, the cardinality of hA in the boundary region is a linear function of h. 



3 The distribution of hA in the boundary region 



Let A be a finite set of lattice points in Z^. Let /i be a positive integer. 

Definition 3.1 An element of A is a vertex of A if it is the vertex of the polytope formed by 
the convex hull of A. Similarly, an element of A is an interior point of A if it is the interior 
point of the polytope formed by the convex hull of A. 

Let Va denote the set of vertices of A. 

Definition 3.2 Let a and b be two vertices of a polytope. We say that a and b are the 
adjacent vertices if the line segment connecting a and b is an edge of the polytope. 

Without loss of generality, we can assume that 0 G T. If there is an element a E A 
such that a is an interior point of A, then we can assume that 0 is an interior point of A by 
considering the set A — a. HA does not contain any element that is an interior point of A, 
then we can assume that 0 is a vertex of A by considering the set T — a where a is a vertex 
of A. 

Label the nonzero vertices of A in the following way, 

K4\{0} = {ajLi 

SO that consecutive vertices are the adjacent vertices. If 0 ^ Va, then ai is adjacent to ai. If 
0 G Va, then 0 is adjacent to both ai and a/. However, ai and o/ are not adjacent to each 
other. This particular case will be excluded from the proofs of Lemmas 3.1 and 3.2, and 
will be discussed separately in the proof of Theorem 4.2. Let A(0, Uj-i-i) be the convex 
hull formed by the three elements {0, a^,ai+i}. Partition the elements of A according to 
A(0, ai,ai+i) where a^, are adjacent vertices: 

Ai = y4 n A(0, tti, ai+i) for z = 1, ...,/ — 1 
Ai = ^nA(0, a;,ai) 
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Lemma 3.1 Letai, a^+i e 7?. Suppose z G A(0, a*, a^+OnZ^. Then there exist nonnegative 
rational numbers q' and q” such that 0 < q' q" < 1, and 

z = q'ai -4- q''aij^i 



Proof. 

Let ai = [ui,U 2 ), a^+i = and 2: = (^1,^2), where Ui,U 2 ,vi,V 2 ,zi, Z 2 are all integers. 

The solutions g' and q” to 

z = aiq' + ai-^-iq" 

is the solution to the system of equations 

Zi = Uiq' viq" 

Z2 = U2q' + V2q” 

The solution is rational. Moreover, since z G A(0, a^+i), 2: is contained in the set defined 

by 

{AiOj + A2fli+i I Ai, A2 G U {0}, 0 < Ai + A2 < 1} 
thus prove the lemma. □. 

By Lemma 3.1, for all a ^ Ai <Z A, there exist nonnegative rational numbers q'^ and q'^ 
such that ^ <q'a A q” < I and 

a = Qa^i T qa^i+l 
Let 

ma = min{m G Z+ U {0} | mq'^ G Z+ U {0}, and mq” G Z+ U {0}} 

Thus, there is a linear expression 

maa = m'^Oi T (1) 

where m'^ = maq'^ and m” = maq” are nonnegative integer coefficients and 

iria > ruaiq' + q”) > A m” 

Since A is finite, let 



Let 



m* = maxima} 

o^A 

k* — I I 

N = max II a || 



e = m*k*N 
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For z = 1, . . . , define a subset of such that 



Ki{h) = A(0, hai , . . . , /za^-i, /la^+i, . . . , hai) 

is the convex hull formed by the elements of F/i^U{0}\{/iaj}. Define a subset of A(0, /laj, /la^+i) 
that intersects the boundary region of /\hA as follows: 

^ hai-^i) I d{x, Ki U 7Ci+i) > e} for z = 1, — 1 

^i(h) = {x e A{0,hai,hai) \ d{x, KiU Ki) > e} 



The following lemma considers a special property of an element in the boundary region 
of hA. 

Lemma 3.2 Let A be a finite subset of 7? containing^. Let h be a positive integer. Suppose 
Oi and ttj+i are adjacent vertices. For i = I, . . . fi, ifwE Ai{h)nhA, then w — ai ^ {h — 1)A 
and w — a^-i-i E {h — 1)A. 



Proof. 

Define a subset A[ C Ai C A, where 



A[=: Ai\l{0,ai) 

which does not contain those elements of Ai that lie on the line segment 1(0, Qi). 
As an element of hA, w can be represented in terms of the elements of A: 

Ka + X] 

oeA'. 

a^A'i 



where EaeA ha = h. Let 



z;' = X haQ 

d G -A 

aiA!. 



Then w' E Kfih). Since w E Afih), by the definition of Afih), 

\w — w'\ = I X had\ > ^ = rn*k*N 



But 



This implies that 



X haN > X ha\a\ > I X had\ > e = m*k*N 

aeA'. oCA' 

X ha > m*k* 

aeA'. 
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Since k* > \A[\ for all z, there exists an a € such that ha > m* . Thus, by equation (1), 



haa = {ha - ma)a + + m"a,+i 

where rria > rn'a-h m” and and are both nonnegative integers. Furthermore, m'a 7^ 0, 
because a € A[. This proves that w — a^+i € {h — 1)A. 

Similarly, let 

A'- = Ai\l{0, aj-j-i) 

It can be proven that w — ai ^ {h — 1)A. □ 



Let p be a nonnegative real number. Define 



k{h,p) — {x e A(0, /in.j, /laj+i) | d{x,l(hai,hai^i)) = p) 



to be the line that intersects A(0, /za^-, parallel to /(/zaj, /zai+i) and is equal to p 
distance away from /(/zaj, /la^+i). Define the boundary region of A(0, /za^, /za^+i) as: 

^i{h, p) = {.T 6 A(0, hai, hai^i) | d{x, l{hai, hai +i))<p} 

which is the region that contains all those elements of A(0, /zoi, /zaj+i) that are less than p 
distance away from the boundary line /(/za^, /za^+i). 

For small h, Fi{h, p) D Ai(/z) = ^i{h). For h sufficiently large, Fi{h, p) Pi Aj(/z) ^ Ai(/z). 
In other words, there exists an element x € At(/z) such that x ^ Fi{h,p). 

Lemma 3.3 Let h be a positive integer. For h sufficiently large, there exist positive integers 
Mi and Ni independent of h such that for all real numbers r such that Mi < r < h — Ni, 

{rai + tai-^i |i€R, h — Si — r<t<h — r}c Ai(/z) 



where 

S^ = {6eK+ I {h-S)ai+i eli{h,p)} 



Proof. 

Define for all positive integers /z, 

Ai(/z) = min{A e R | Aa^ + (/z — A — Si)ai+i e Af(/z)} 

Let h' be a positive integer, and let u{h') = Xi{F)ai + {h' — \i{h') — Si)ai+i, then u{h') is an 
intersection of ffih' , p) with the boundary of Ai(/z'). Since the line li{h' , p) is parallel to the 
line l{h'ai, h'ai^i), and the boundary of Ai(h') is parallel to the boundary of Ki{h'), the four 
lines form a parallelogram. Furthermore, u{h') and /z'aj+i are the opposite vertices of the 
parallelogram. Let h" be a positive integer different from h' . Define u{h") = Ai(/z")aj + {h'' — 
\i{h") — Si)ai^i. Then u{h”) and h"ai^\ are the opposite vertices of the parallelogram formed 
by the lines li(h",p), l{h"ai, h"ai^i), and the boundary lines of Ai{h"), and Ki{h"). Since 
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h{h, p) is p distance away from l{hai^hai^i) and the boundary of Aj(/i) is e distance away 
from the boundary of Ki{h) for all positive integers h, the two parallelograms are congruent. 
Thus, the corresponding diagonals of the parallelograms, u{h') — and u(h”) — 

are equal. Hence, for all positive integers h' and h" ^ 

= u{h‘)-h'ai^i 

T {h!' — — h"ai^i = Xi{h')ai + [h' — Xi{hf) — 5i)ai^i — h'ai-^i 

-^l(^ ~ )(®t ®i+l) ^iO>i+l 



This implies 

Ai(/i') = Ai(/i") = Ai 

is a constant independent of h. Let Mi = [Ai] the least integer greater than Ai, then Mi is 
a constant independent of h. 

Similarly, for all positive integers h, define 

X 2 {h) = min{A 6 R | (/i — A)a^ + (A — Si)ai^i e A^(/i)} 

It can be shown again that for two different positive integers /i' and /i", the parallelogram with 
opposite vertices (h' — X 2 (h'))ai + {X 2 {h') —5i)aiJ^\ and h'ai is congruent to the parallelogram 
with opposite vertices {h” — X 2 {h"))ai + [X 2 {h'') — and h"ai. Thus, 

{h" — X2{h”))ai + {X2{h") — 6i)ai-^-i — h" ai = {h! — X2{h'))ai + {X2{h') — — h ai 

X2{h"){aijf-i — ai) — Siai+i = X2{h'){ai^i — ai) — Siai^i 

This implies 

A2(/i') = A2(/i") = A 2 

is a constant independent of h. Let Ni = [A 2 J the greatest integer less than A 2 . So Ni is a 
constant independent of h. 

Moreover, for all real numbers r such that Mi < r < h — Ni, 

{rai -f I i G R, h — Si — r<t<h — r}c Ai{h) 



□ 



Let hi^n[h) G l[hai, hai^i) be determined by n and h as follows: 

^i,ni,h) — Tltti {Jl> 

For each nonnegative integer n, such that 0 < n < /i — 1, define a subset of R^: 

B^^n{h) = {bi^n{h) - ra^+i + ta 2 {ai - a^+i) | 0 < r < (5i, 0 < t < 1} (2) 



For Mi < n < h — Ni — I, 



Bi^nih) Cl Fiih, p) lXi{h) 
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The following lemma shows that for all nonnegative integers, m and n such that 0 < 
m.n < /i — 2, the elements of Bi^rn{h) are the elements of translated by special 

elements of hA. This will play a role later in the article when we consider the boundary 
elements of hA. 

Lemma 3.4 Let h be a positive integer whose value is fixed. For all nonnegative integers n 
such that 0 < n < h — 2, Bi^n+i{h) = Bi^n(h) + ai — a^+i. 

Proof. 

Let X = bi^n{h) — + t{ai — a^+i) € Bi^n{h) for some r and t such that 0 < r < and 

0 < t < 1. Then 



X — bi^ni^h^ vai.^1 T ti^ai 

= nai + {h- n)a^+i - roi+^ + t(ai+i - a.j) 

.T -f aj - Oi+i = {n -f l)aj + {h - (n + l))ai+i - ra^+i + t(ai+i - Oi) 

= bi,n+i(h) - rai+i + t{ai+i - a^) 

By definition, x a^ - € Bi^ri+i{h). Thus, Bi^n{h) 4- - ai^i C Bi^n+i{h). 

Conversely, if x € then 

^ (^) ^^i+1 T 

= (n + l)aj + (h - (n + l))a^+i - ra^+i + t{ai - 
= noi + {h - n)ai+i - ra^+i + t{ai - ai+i) + 

— bi^ri{h^ ~h i{oji ~H ai 

Let y = bi^n{h) - ra^+i 4- t{ai - a^+i) € so x = y + ai - a^+i G - a^+i. 

Thus, Bi^n+i{h) C Bi^ri{h) -f a* - a^+i. Therefore - fli+i. □ 

The following lemma shows that for h sufficiently large, the set Bi^n{h-\rl) is a translation 

of the set Bi^n{h). 

Lemma 3.5 For all nonnegative integers n such that 0 < n < h — 1, Bi^ri{hF 1) = Bi^n{h) 4 - 



a^+i) 
- di+i 



Proof. 

Again, let x G Bi^ri{h), then for some real numbers r and t such that 0<r<c^^,0<t<l, 
we have 



X = bi^n{h) - ra^+i + t{ai - a^+i) 

= noi 4- (h — n)ai^i — ra^+i 4- t{ai — fli+i) 

X 4- a^+i = noi -f ({h + 1) - n)ai+i - rai+i -f t(a^ - Oj+i) 
— bi^riifi 4* 1) 4" t{ai 
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Thus, a: + a^+i e + 1) implying + a^+i C + 1). 

Conversely, if rc 6 -hi), then 

^ = bi,n(h + 1) - ruj+i -h t(ai - a^+i) 

= nai -h {{h + 1) - n)ai+i - rai+i + t{ai - a^+i) 

= nai T {h - n)ai+i - ra^+i + t{ai - a^+i) -h a^+i 
= bi,n{h) — raj-i-i -f t{ai — ai^i) -f ai+i 

There exists an element y = hi^n{h)-rai+i+t{ai-ai+i) € such that x = t/T^z+i, thus, 

X € -ha, -+i implying -hi) c Bi,„(/i)-hai+i. Hence, B^,„(/i-h 1) = + a,+i. 



Define 



Mi-1 

B,,p{h) = (J S,,„(/i) 

n=0 

/ h-1 \ 


(3) 


BiAh) = U BiAh)]nFi{h,p) 

\n=h-Ni J 


(4) 



The set Bi^F{h) represents the area of the first cell which is bounded by the lines l[hai, hai^i), 
h{h^p), /(O, hai_^i), and k^Uiih)- On the other hand, Bi^L(h) represents the area of the last 
cell which is bounded by the lines l{hai, hai+i), li{h,p), li,Ni{h), and /(O, /la,). 

The next two lemmas consider the first and the last cell separately, but prove the same 
result: the h + 1 level cell is the translation of the h level cell, hence the cardinality of the 
cell is independent of h. 

Lemma 3.6 Bi^F(h + 1) = A,F(h) + a^+i. 

Proof. 

^.From equation (3), 

Mi-1 

+ a,-|-i = [J Bi^n(h) + QiFi 

n—O 

Mi-1 

~ U +1) 

n=0 

= Bi^F{h -h 1) 



Thus proves the lemma. □ 
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Lemma 3.7 Bi^L{h + 1) = + a^+i 

Proof. 

Suppose X E BijXh), then x E Fi{h, p) and x E Bi^n{h) for some nonnegative integer n such 
that h — Ni < n < h — 1. By Lemma 3.5, it has been shown that x + E + 1). 

To complete the proof, we need to show that x + ai^i E i^ (/i -I- 1, p). Let w he a, point 
on the line l{hai^ hai+i), such that \w — x\ = D < p. Since w + a^+i is a point on the line 
^((/i + l)ai, (/i + l)ai+i), then 



d{x + 0 ^ 4 - 1 , /((/z + l)ai, (h + l)ai^i)) < d{w -f a^+i, a: + ai+i)| = \w - x\ = D < p 

Thus, X + E Fi{{h + 1), p), therefore x + ai+i E Bi^L{h + 1) implying Bi^iih) + ai+i C 
Bij,(]^’ T f )• 

Conversel}^, let x E Bi^L{h + 1), then x E Fj(/r -{- 1, p) and x E Bi^n{h + 1). There exists 
an element y E Bi^nW such that x = y + ai^i. Using the similar argument as before, it can 
be shown that y E Fi{h, p) and y E Bi^L(h), thus implying x E Bi^L{h) + a^+i. Hence, prove 
the lemma. □ 

Let Si^n(h) denote the subset of hA that belongs in Bi^n[h). Let Ui{h) and Vi{h) denote 
the subset of hA that belong in the first and the last cell respectively, thus, we have the 
following definitions: 



Si,n(h) 


= hAn Bi^n{h) for Mi < n < h — Ni — 1 


(5) 


Ui{h) 


— hA n Bi p{H) 


(6) 


Vi{h) 


— hAp[Bi^L{h) 


(7) 



The next two lemmas consider the elements in the boundary region of hA and prove that 
the cardinalities of hA in the cells Bi^n{h) are equal and independent of h. 

Lemma 3.8 For all nonnegative integers n such that Mi < n < h — N{ — 2, 

5*1,71+1 (fi) “ T cij O'i+i and 

\Si,n+l{h)\ = |5,,„(/l)|. 



Proof. 

Suppose X E Si^n(h)^ then x E hA and x E Bi^n{h). Lemma 3.4 implies that x -E ai — E 
Bi^n-\-\{h). There left to prove that x + E hA. Since Bi^n(h) C (Fj(/i, p) n Aj(/i)) 

for all nonnegative integers n such that < n < — 7V^ — 1, so x E Aj(fi). Moreover, 

X E A:^iih) n hA. By Lemma 3.2, both x — and x — aj-j-i are in {h — 1)A. Thus, 



X -E Ui — aj_|-i = (x — tti+i) + Ui E {h — 1)A F A = hA 
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Therefore, x + and thus Si^n{h) + - a^+i C Si^j,+i{h) 

Conversely, if .r e Si^ri+i{h), by definition, x E hA and x E Lemma 3.4 shows 

that there exists an element y E such that x = y -{- ai - Oi^i. Let y = x - ai + ai+i. 

Again, x E ^i{h) since Bi^n^i{h) C Fi(h, p) fl Ai(h), so by Lemma 3.2, both x — ai and 
X — are in {h — 1)A. Therefore, 

y = X - tti F tti^i = {x — ai) + tti+i E {h - 1)A A = hA 

implying?/ € which implies x € + ai - a^+i. Hence, 5i,„+i(/i) C Ta* — 

tti+i. Hence, Si^n+i{h) = Si^n{h) + o.i — ai+i. This also proves that = |*5'i,n(h)|. □ 



Lemma 3.9 For all nonnegative integers n such that Mi < n < h — Ni — 1 , 
5'i,n(fi + l) Si^n{h) A ai-^i and 

|5,,„(h + l)| = |5,.n(/l)|. 

Proof. 

Result follows directly from Lemma 3.5. □ 



Lemma 3.10 



Ui[h T 1) — Ui{li) -j- Ui-fi and ^i{h + 1) — \^i{li^ + 



Furthermore, 



\Ui{hAl)\ = \Ui{h)\ and |Fi(h+l)| = \Vi{h)\. 



Proof. 

Refer to equations (6) and (7) for the definitions of Ui{h) and Vi(h). The result follows 
directly from Lemmas 3.6 and 3.7. □ 

The interesting finding of this research is that for h sufficiently large and for all Mi < n < 
h — Ni — 1, the cardinality of hA in each set Si^n{h) is a constant independent of h and n. The 
cardinalities of Ui{h) and Vi{h), the first and the last cell of the partition, are also constants 
independent of h. Thus the cardinality of hA in the boundary region of A(0, hOi, hoi+i) is 

• ( number of cells ) A \Ui{h)\ A \Viih)\ 

The only component of the above expression that depends on h is the number of cells which 
is determined by Mi and 7V^. 
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4 Main theorems on the distribution and the cardinal- 
ity of the boundary elements of hA in 

The following theorem proves the special case by considering the pattern of hA in the bound- 
ar}^ region of A(0, /lai+i ) where and 0^4.1 are two adjacent vertices of A. 

Theorem 4.1 Let A be a finite subset of 7? containing 0. Suppose and are two 
adjacent vertices of A. Let h be a positive integer and p a nonnegative real number. Define 

Fi(h, p) = {x e A(0, hoi, hai+i) | d{x, l{hai, hai+i)) < p) (8) 

Then for h siifficiently large 

\Ffih,p)nhA 1 = Q^h- Di 

where Ci and Di are some constants. 

Proof. 

Let Mi and Ni be two constants independent of h, determined by the intersection of the 
line li{h, p) with the boundary of Aj(/i) as in Lemma 3.3. Define Bi^n(h), Ui{h), and 

Kj(/i) the same as in equations (2), (5), (6), and (7). 

For h sufficiently large, 

h-Ni-\ 

Ffih,p)r\hA = U Si^n{h) IJ Ui{h) U Vfih) 

i=Mi 

where the intersection of any two sets in the union is empty. Thus the cardinality is 
\Fi(h,p)nhA\ = '‘yI + \Uiih)\ + \Vi{h)\ 

i=Mi 

Let Ci = Then Q is a constant independent of n and h by Lemmas 3.8 and 3.9. 

Furthermore, the cardinality of Ufih) and Vfih) are constants independent of n and h by 
Lemma 3.10. 

Therefore, the number of elements of hA that are within p distance away from the face 
l{hai., /lOj+i) is: 

\r(h,p)nhA\ = + \Ui(h)\ + \v\(h)\ 

i=Mi 

= {h-N,- Mi)Ci + I U,[h) \ + I Vi(h) I 

= Ci-h-Di 

where A = (Mi + iV<) • a - \U,(h)\ - |Ki(/i)|. □ 

By using the result of Theorem 4.1, Theorem 4.2 proves the general case. 
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Theorem 4.2 Let A be a finite set of lattice points in T? . Let denote the boundary 

of AhA- Let h be a positive integer. Define 

p) = {x G Aha I d[x, OAha) < p] 

to be the region that is less than p distance away from the boundary. Then for h sufficiently 
large 

I hAr\FA{h,p) \ = C-h + D 
where C and D are some constants independent of h. 

Proof. 

Case 1; A contains an element that is in the interior of the convex hull of A. 

Without loss of generality, we can assume that 0 G ^ and that 0 is in the interior of the 
convex hull of A. Since we can consider the set A — where a G A is an interior element of 
the convex hull of A. 

Let Va be the set of vertices of A. Let 



Va = {flilLi 

where I — \Va\. Assume the consecutive vertices are the adjacent vertices, and ai and ai are 
the adjacent vertices. Define 

Fi{h, p) — {x 6 A(0, hOi, /ia^+i)\/(0, hai^i) \ d(x, l(hai, haj+i)) < p} for z = 1, ...,/ — 1 
Fi(h, p) = {.T G A(0, hai, /zai)\/(0, hai) | d[x, l(hai, hai)) < p} 

apply Theorem 4.1 to each set A(0, hai, hai+i) and A(0, hai, hai), thus, for alH = 1, . . . , /, 

\hAnFi{h,p)\ = Cih-Di 



where Ci and Di are constants. 

LoXTfih)=Ufih)n\\+i{h). 

Ti{h) = {n G Z+ I d{nai^i, hai^i) < p] 

Since the cardinality of Ufih) and Vfih) are independent of h, the cardinality of Ti{h) is also 
independent of h. Therefore, 

\hAnFA{h,p)\ = j^\hAr\Fi[h,p)\-Y.\Ti{h)\ 

= j^Cih-Di 
2 = 1 

= Ch-D 
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where 

C = J2C, and D = ^A + El^i(Mi 

i=l r:=l 

Case 2 ; A does not contain an element that is in the interior of the convex hull of A. 

Without loss of generality we can assume 0 G ^ and 0 is a vertex of the convex hull of 
A. Since we can consider the set ^ - a where a e A is 3, vertex of the convex hull of A. 

Let Va he the set of vertices of A. Let 

where I = |Lyi\{ 0 }|. Assume the consecutive vertices are the adjacent vertices, and 0 is 
adjacent to both ai and ai. Define 



Fi{h, p) = [x e A( 0 , hai, /m,+i)\/( 0 , ha.i^i) \ d{x, l{hai, /la^+i)) < p) for z = 1, 1 
apply Theorem 4.1 to each set A(0, /za^, /zoi+i), thus for alH = 1 , . . . , / - 1 , 

\hAnFi{h,p)\ = Cih-Di 



where Q and Di are constants. 

Define the remaining two boundary regions of A/^^ in the following way: 

Fo(/i, p) = {x G A(0, /lai, /za 2 ) I d(x, /(O, /lai)) < p} 

Fi{h,p) = {x e A{h,hai-i,hai) \ d{x,l{h,hai)) < p} 

To be able to apply Theorem 4.1 to the case of \hA D Fq(/z, p)| and \hA n Fi{h, p)\, we will 
translate the set A by a. 2 - Consider the set A' = A- a 2 and let a(j = 02 -a 2 = 0 ; a[ = ai — 02 ; 
a '2 = qq - a. 2 ; and a '3 = ai- a 2 - Applying Theorem 4.1 to {0, ai, a^} and {0, a^, 03 }, we have 

\hAnFo{h,p )\ - \hA' n Fo{h, p)\ 

and 

\hAnFi{h,p)\ = \hA'nFi{h,p)\ 

Putting it all together we have for z == 0, and /, 

\hA n F,{h, p)| = \hA' n F,(/z, p)| = Q/z - A 
Let 

Ti{h) = HAD F,{h, p) fl Fi+i(/z, p) = Ui{h) H Vi(/z) for z = 0, 1, ...,/- 1 
Ti[h) = MnFz(/z,p)nFoA,p) = A(/i) + Co(/z) 

By Lemma 3.10, the elements of Ui(h + 1) and Vi(/z -f 1) are simply the translation of the 
elements of Ui[h) and Vi{h) by ai^i. Thus, the elements of Tj(/z-f 1) are also the translation 
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of the elements of T^(/i) by Since the cardinality of Ui{h) and Vi{h) are independent of 
/i, so is the cardinality of Ti{h). The cardinality of hA in the boundary region is: 

I hA n F^h, p) I = i^lhAn Fi{h, p) I - E |T.(ft)| 

= j2Ci-h-Di-m(h)\ 

z=0 

= C-h-D 

where C = E'=o Ci and D = E'=o A + \Ti(h)\. □ 

Define 

^hA(p) = 6 A,,a I d{x,dAi,A) > p} 

to be the set of elements of that is greater than or equal to p distance away from the 
boundary of /^ha- 

Theorem 4.3 (Khovanskii) Let A be a finite subset o/Z” such that A generates . Then 
there exists a constant p with the following property: for an arbitrary natural number h, every 
lattice point of the polytope A^Aip) belongs to hA. 



Theorem 4.4 Let A be a finite subset of 7 ? such that A generates Z^. For all sufficiently 
large positive integers h, there exists a nonnegative real number p independent of h such that 

hA = (Ahx(p)nZ^) U {FA{h,p)nhA) 

where p) H hA\ is a linear function of h. 

Proof. 

Let p be the real number in Theorem 4.3. Thus /LhA^p) H Z^ C hA. For all h sufficiently 
large, 

hA = (A/,a(p) n hA) U {FA{h, p) n hA) 

= (A,.A(p)nz^)u(F^(/i,p)nM) 

By Theorem 4.2, |F^(/i, p)(^hA\ is a linear function of h. □ 

In conclusion, Theorem 4.4 has presented a general structure for hA where A is a finite 
subset of Z^. The general structure is this: There exists a nonnegative real number p such 
that all lattice points whose distance from the boundary of /Sha is greater or equal to p 
belongs in the ’’core” structure of hA, and those lattice points whose distance from the 
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boundary is less than p belongs in the boundarj^ region. The distribution of these boundary 
elements of hA exhibited a regular pattern since they are related by a translation of an 
element. Khovanskii had shown that the cardinality of those elements of hA in the ’’core” 
structure is a function of h” where n is the dimension of the space, thus in the case of Z^, 
the cardinality is h?. The author had shown explicitly by using simple counting principles 
that the cardinality of the boundary elements of hA is which in the case of 7 ? is a 

linear function in h. 



References 

[1] G. Ewald, Combinatorial Convexity and Algebraic Geometry, volume 168 of Graduate 
Texts in Mathematics. Springer-Verlag, New York, 1996. 

[2] S. Han, C. Kirfel, and M. Nathanson, ’’Linear forms in finite sets of integers,” The 
Ramanujan Journal. 2(1998), pp. 271-281. 

[3] A.G. Khovanskii, ’’Newton polyhedron, Hilbert polynomial, and sums of finite sets,” 
Funksional. Anal. Prilozhen. 26 (1992), pp. 276-281. 

[4] M.B. Nathanson, Additive Number Theory: Inverse Problems and the Geometry of Sum- 
sets, volume 165 of Graduate Texts in Mathematics. Springer-Verlag, New York, 1996. 




ON NTU'S IN FUNCTION FIELDS 



Howard Kleiman 

1. Introduction. Let g(x,y) be a polynomial of degree n in x with 
coefficients in z[y] which is absolutely irreducible. Suppose 
that the coefficient of x" is c' while g(0,y) = c with c, c' e z. 
Let b be a root of g(x,y) . Then b is an NTU (non-trivial unit) of 
F = Q(y) (b)/Q(y) . We can generalize Hilbert Class Field Theory 
using NTU units. Also, by elementary algebra, there are but a 
finite number of solutions of q{x,y) = 0 in Q X z. We can 
explicitly obtain them. Furthermore, using a computer algorithm, 
we can explicitly obtain solutions in Q X z of a large number of 
minimal equations of F. 



2. theore:m. Let f(x,y) = 0 be a polynomial of degree n with 
coefficients in z[y] defining the field L = Q (y) (a) /Q (y) . Assume 
that f(x,y) is absolutely irreducible. Suppose that 

(1) f(0,y) factors into cq(y) in z[y] where Ic l> 1, 

c E z, c is not a perfect n-th power in z, and q(y) is monic and 
irreducible; 

(2) there exist rational integers y' and y' ' such that both 
f(0,y') amd f(0,y'') are irreducible over Q with indices 
relatively prime to c; 

(3) f(0,y') and f(0,y'') have c as their gcd; 
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(4) t{Q,y*)/c (y* = y‘ , y") is relatively prime to c. 

(5) If a' = a(y') and a'' = a(y''), the class fields of 
L* = Q(a') and ii''= Q(a'') have Q as their intersection. 

Then h contains an NTU. 



Proof. Either a is divisble by an NTU, u, in L whose absolute 
norm is c, or else there exists an algebraic integer A, unique to 
within an algebraic unit, which divides a = a(y) for every 
specialization, y* , of y into a rational integer. In the latter 
case, if conditions (1) - (4) hold for y = y* , the ideal 
(c, a*) = (A)* is a factor of (cq(y*)), y* e z, . Therefore, (A)* 
is contained in the class field of L* = Q(y*)(a*) (y* = y* / y''). 
implying that their respective class fields both contain Q(A) as 
a subfield, contradicting (5) . Therefore, a is divisible by an 
NTU u (not necessarily in h) . But, from (1) , q(y) is an 
irreducible element of Q[y] . Therefore, (a, q(y) ) is a prime 
ideal of degree 1, that is, a principal ideal in . Therefore, 
(S/ q(y)) = (a), (a) is representable by the ideal number ua 

where u is a unit in L. u is either an element of Q or an NTU. 

But its absolute norm is ± c which isn't a perfect n-th power. It 
follows that u is an NTU. 




Continued Fractions and Quadratic Irrationals 



Joseph Lewittes* 

Department of Mathematics and Computer Science 
Lehman College - City University of New York 
Bronx, NY 10468 

lewit tes0alpha . lehman . cuny . edu 



Most books on number theory contain a chapter or two on continued fractions, 
which really are indispensable in a number of areas. Nevertheless, they have not 
achieved a mainstream popularity and are often omitted in courses on number theory. Of 
course there are reasons for this; their basic construction strikes one as rather bizarre and 
they are notoriously impossible to manipulate with respect to the usual operations of 
arithmetic. Furthermore, they have no satisfactory generalization to fit into a more 
comprehensive framework. But, they are surprising and interesting. One of the key roles 
played by continued fractions is in the construction of units in real quadratic fields. In 
studying this topic we have found that the entire theory of units in such fields can in fact 
be derived via continued fractions. Also, the periods of the continued fractions associated 
with quadratic irrationals exhibit a certain pattern or structure when classified by 
discriminant, which has not been previously noted. 

In presenting these results I am also attempting an introductory exposition of 
continued fractions. This paper is meant to be self-contained in the sense that all basic 
terms will be explained; specific facts quoted from the literature can simply be accepted, 
for the moment, without impeding the reader’s progress. Our focus is narrow, 
concentrating on the continued fractions themselves, omitting any discussion of their 
intimate connections with ideal theory and binary quadratic forms. The following 
theorem may be considered an ultimate goal; eventually its statement will become 
meaningful, numerical examples will be given and perhaps the reader will be intrigued. 



’Partially Supported by a PSC-CUNY Research Grant. 
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Theorem Let be a discriminant and e = , the fundamental unit. Let x 

be a reduced quadratic irrational with discriminant d, having the purely periodic 
continued fraction . Let e be the number of terms b, that are even. Then 

the parity of e depends on d only, as follows: 

If d is odd, then e is even. If d is even, then e=u (mod 2). 

We formally designate as theorems those statements that contain new results. A 
standard reference on continued fractions in Perron’s book [5]; this is the third edition. 
The classical results that we cite are all found there in the first three chapters. A recent 
book in English is [6]. For more on orders in quadratic fields and binary quadratic forms 
see [2], chapter 2 and [1], a more leisurely approach. But these do not integrate continued 
fractions into their discussion. While [4] does - see, in particular, the preface - it treats 
only the case of field discriminants. 



1. Finite Continued Fractions 

Continued fractions may be motivated by the Euclidean Algorithm, which we 
recall with a simple example. To find the greatest common divisor of 26 and 7 (denoted 
gcd(26,7) or just (26,7)) one performs successive integer divisions yielding quotient and 
remainder to obtain: 



26 = 3x7+5 
7 = lx 5 + 2 
5=2x2+l 
2 = 2x 1 

The last non-zero remainder, 1, is (26,7); the quotients generated, 3, 1, 2, 2, underlined 
above, seem to be irrelevant. But, waste not want not! Dividing both sides of each 
equation, save the last, by the divisor, they become: 



26 1 7 

By successive substitutions one then obtains 



\_ 

1 

2 





1 + 



1 



2 



^2 



(1) 
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and — is now presented as a complicated compound fraction, but using only the 

underlined quotients, and a 1 for each numerator. Such an expression is called a (finite) 
continued fraction. In general, if are any numbers then 



bo + i (2) 

^1 + 

1 

^2 + • . 1 
K-2 + 7 

is called a continued fraction. Note that n> 1 and if «=1 we have only - but it still 
qualifies technically as a continued fraction. We refer to ,...,Z>„_, as the terms of the 
continued fraction (the n terms are indexed from 0 to /i-l) though in the literature they are 
called the ‘partial quotients.’ (2) is too bulky so we write it in condensed form 

[/?(), Z?, As long as ,...,Z?„_, are positive real numbers, which we assume from 

now on, though may be < 0, (2) has a well-defined numerical value, again designated 

by the symbol ,...,Z7„_j ] . Thus (1) shows — = [3, 1,2,2]. Since 2 = l + j, 



y = [3,l,2,u] also. 



26 



What worked for — shows that any rational numbei x can be expressed as a 



continued fraction. Write x = , c, d integers, d>0, not necessarily relatively prime, and 

d 

apply the successive divisions of the Euclidean Algorithm starting with division of c by 
d. If lines are produced with the quotients being ,...,Z?„_, , then a- = [Z?o,Z?, ,...,Z?„_, ]. 
Note that if a is not an integer then n>\ and the last term b„_^ > 1 so that also 
^ = [*0,6 -U]. If A is an integer then n=\, the one line of the algorithm reads 

c = b^d , with Z?() = A , so a = [z?„] = [z>q - 1,i] . In particular, every rational a has two 
continued fraction representations, one having an even number of terms and the other an 
odd number. This observation plays an important role later on. 
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The reader will have noticed that we’ve been tacitly assuming that the terms 
are integers. This indeed is the case, but later on we will want to use the option of having 
the last term not an integer. So this should be kept in mind, but from now on, unless 
otherwise noted, the terms of the continued fraction will be integers only. 

Actually, the two ways shown to represent x as a continued fraction are the only 

ways. This can be seen as follows. First note that if -f \ — is any continued 

(b^-^- *.) 

fraction with three or more terms then its value is not an integer, since (Z?, -f‘ • .) >1 ; also 

~ is not an integer if Z?, > 1 . Thus the only possible continued fractions for an 
bx 

integer x are those two given above. Suppose now x not an integer and [c^ , c, , . . . , c„_, ] is 

any continued fraction whose value is x. Then x = Cq -f — , y = [c, ,... ,c„_, ] and y > 1 

since x not an integer. Thus Cq = [x J . Here, for any real number t , J denotes the 
greatest integer < r , the unique integer k, such that k<t <k + \. We use this notation 
rather than the customary [/] to minimize confusion with the square brackets used for 
continued fractions. If also x = we have = L^J = , hence 

,...,a^_, ] = [c, ,...,c„_,] . If this common value is an integer we have only the two 
possibilities described above, while if not, then a, = c, and continue on t;ie same way. 

So far we’ve discussed how a rational number can be represented by a continued 

fraction. We now turn around and ask, given ,...,Z?„_, ] how should it be evaluated? 




The natural inclination is to do it from 



the bottom up, 3 + ^ = y , then and finally 5 + ^ . This is fine, but 

eventually continued fractions will become ‘infinitely long’ by allowing the number of 
terms to increase, and from this perspective it is better to evaluate from the top down. So 
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429 

[5, 4, 3, 6] = — — . But how are all these related to each other and the final result? To 

o2 

answer this, consider, for the moment as variables. Then [^oJ 

[ba,b^] = b^+ — = ^ * . If we set A,,=b^, = 1 , A, = b,bg + 1 , B, = 6, then 






which suggests defining A^^b^A^ + A^, B^=b^B^+B^ so that = ^. This 

Bi 

scheme may then be continued. Formally we define, given the sequence ,^ 2 »•••» 

^.,=1, i?_,=0, B, = \ (3) 

and inductively for i > 1 

= b,A._, + A .,, , B^ = b^B,_, + B,_2 . (4) 

The /t_| , B_^ are a useful convenience. They, along with Bq , are constants while for 
/>0, 4, B. are polynomials in Z?o,Z?, but 5, is independent of . A suitable 
induction argument then shows that for all n>] 



[boA^-A-,hj^- ( 5 ) 

^w-I 

This is a formal algebraic identity which then remains true if one substitutes for bQ any 
number and for the , / > 0 , any positive numbers ; so 5„_, ^ 0 . Here number may 
mean integer or real but we are interested only in integers. Returning to the original 

question as to how to evaluate the continued fraction [bo,b^,...,b,,_^] the answer is to 



A 

calculate successively, and then the value is — This is done best 

A-i 

with a table which we present for our example [5, 4, 3, 6], 



/ 


-1 


0 


1 


2 


3 


b 




5 


4 


3 


6 


A 


1 


5 


21 


68 


429 


B 


0 


1 


4 


13 


82 



( 6 ) 
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First the two rows labeled / and b are filled in; / acts as column index. Then the rows 
labeled A and B are filled in left to right, according (3), (4). Then for any / > 0 , the value 

of ] is read off from the table as ^ . 



2. Matrices and the Group G 

( A A 

The definition (3), (4) can be written ' 



4-1 A-iYb, 1 



(A- A- ^(b 1 

If /■ > 2 , then the first matrix on the right is similarly 

V ^/-2 ^,-3 



e,., «„Ai 

and iterating 



this one obtains finally: 



A 4 - 

B^_ 



bo lYft, n fb. 1 



1 oA 1 0/- 



We introduce the abbreviation 



4,-, 1 



Thus we have the following continued fraction - matrix connection: 

is. '. ‘-1- 

(b n 



Since the matrix has determinant -1 we immediately obtain 

4,-,5„-2-4,-2A-, =(-1)". 

Here is a nice immediate application of this matrix viewpoint. Since 



(7) 



( 8 ) 



(9) 



( 10 ) 



1 0 ) 

= so the 

value of the continued fraction obtained by taking the tenns in reverse order is . 



(a b 

symmetric, transposing both sides of (9) yields 

V ^„-2 A -2 



429 

Thus, noting (6), [6,3, 4, 5] = . 

68 
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Basic to all that follows is the set G of all 2x2 matrices ^ ^ith a, b, c, 

integers and det M = ad -be = ±\, which ensures that e G and G is a group with 
respect to matrix multiplication. Note that any two entries in the same row or column of 
A/ must be relatively prime and if a 0 occurs the other entries in its row and column must 
be ±1. 



A notation that will be useful is to define, given a rational x, CF^[x] is the 
continued fraction for jc with an even number of terms and CF [x] is that with an odd 



number. For example, from (6), CF^ 



429 



82 



r 4291 

= [5,4,3,6]and CF — = [5,4, 3,5, l]. If 



CT = ±1 we write CF^ for CF^ or CF_ according as cr= 1 or -1. The following result is 
a lemma that is implicit in the literature but not in the form we need it. 

fa b^ 

Lemma 



Let M = \ , \ eG , c> d > 0 . Let cr = det M , 

KC dj 



CF„ 



= • Then A/ = and this representation - as an M 



matrix - is unique. 

Proof Let A = We have to show A/=iV. By 

construction, (- l)" = det N and (-1)" = cr = det M . Writing, as usual. 



N : 



B„_, 



we have = ~ • Both fractions are reduced and 



c> 0 , by hypothesis, and > 0 follows from (3), (4). Thus = a , = c . Note 



that n>2 since c> d>Q makes c> 1 , — not an integer, det M = det N now implies 

c 



ad -be = aB ^,_2 - eA ^,_2 or a(d - F „_2 ) = ~ A 1-2 ) • So c divides a(^d - B „_2 ) , (c, a) = 1 , 

hence e divides d - B„_2 • Now \ <d<e and also 1 < B „_2 < F„_, = c , so \d - J 5 „_ 2 | < e , 
and being divisible by e this forces d = B „_2 and then b = A „_2 , hence M = N , For the 
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uniqueness, if also M then we would have — = , where 

are the quantities associated with Co,c, . But this implies 

- = [Z7o,^,,...,6„_,] = [co,c,,...,c^_,] and (- 1)" = det M = (- 1)* so k = n{modl). But - 
c c 

has only two continued fractions, one with an even number of terms and the other with an 

odd number, so k = n and c, = bj for all i. 

Now we are interested in the set of all matrices in G that can be expressed as 

(b 



a product of matrices ^ where ^ is a positive integer. Thus 

G" ={MgG I M = ],«> 1 , all positive integers}. (11) 

O'" is a subset -not subgroup - of G and is clearly closed under multiplication: M, 

0 ^ 



N gG* implies MN gG^ . The identity matrix . I Our first theorem 

gives another characterization of G^ . 

fa 



Theorem 



M = \ , I e G"" if and only if 

dj 



a>b>d, a>c>d, d >0. 



( 12 ) 



If A/ G GMet o- = det M , CF 



= . Then M = M 



the unique representation of M as a product of matrices of type [ ^ oj ’ ^ ^ positive 



g 1 



integer. 



Proof 



Let, for the moment, G' be the set of matrices M satisfying the 



a. b, 



inequalities (12). Given M: = ' ' /=1, 2, in G' set 



Ic dJ ' ' U, dj[c, dJ (c,a,+d,c, c,b,+d,dj- 

C> D is equivalent to c, (oj - ) + c/, (cj - c/j ) > 0 . Since c, > c/, > 0 and > 
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c,>d, it follows that C> D . In the same way one shows the other inequalities are 



(A B] 

satisfied to assure 1 ^ I eG\ Thus M, eG' and G' is closed under 

fg n 

multiplication. Since 1 I , g > 1 , is in G' it follows, recalling the definition of G^ , 

that G^ czG' . We now show the reverse inclusion along with the uniqueness. Assume 
(a b) a 

M = ^ \ gG' . U od >0 then a> c, — > 1 , not an integer, so M has the unique 



representation stated, via the lemma, and = 



> 1 , so M eG"^ . If c> d >0 fails, 



then, since M eG \ either l=c>^/ = 0orl = c = ^f>0. In the former case we must 



.a 1 

have b - \ , M - \ ^ ^ | with ^7 > 1 . Then det A/ = - 1 , CF 



a 

lTj 



= [(7] so 
r 1 

M ~ M[aJ eG^ and the uniqueness is clear. \f c-d = \ , = I j ij » 

det M - a-b = ±\ . But a>b then forces a = ^ + 1 , det M = \ . Thus 



a 


= CF^ 


~b + f 


c _ 




1 



i]=(i III = 

M eG* and the uniqueness is clear. This completes the proof of the theorem. 
22 135W 7 43^ f43 135' 



= [^,l] . Thus again 



An example: [ ^ 43 J ’ 1^22 BsJ ’ I 7 22 ' ^ ’ while 



ri35 22' 

^ ^ G G^ . Since det M = -1 we calculate CF 

V43 7 



135 



M 



= X X o] - On ‘he other hand iV = 

= [3,7,5,l] and so = 



43 

135 113' 



= [ 3 , 7 , 6 ] and so 



calculate CF 



135 

43 



43 36 

3 1V7 IV 5 iVl 1 



eG* , det = 1 , we 



1 oAi oAi oAi or 
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3. Infinite Continued Fractions 

We’ve seen that the process for obtaining the continued fraction of a rational 
number can be expressed in terms of the greatest integer function. As such, it can also be 

applied to an irrational number jc. Set Xq=x, = L^o J • Then < Xq < + 1 so setting 

> 1 we have jCq = + — . Again set b^ = [a:, J , x^ = — > 1 and 



Xo~bo 



continuing this way we generate a sequence jCq , jc, ,^2 ... of real numbers and integers 
^0 ,^2 ’••• ^ ^ ’ ^/-i “ ^/-i ’ ^/ > 1 and 



:i'o = + - 



b\ + ■ 



[^0 ’> b ] 5 ^,] ’ 



(13) 



bi + 






Since Xq is irrational so is each x, and the subsequent = - 



1 



is always well- 



Xi - 

defined. (13) expresses Xq as a finite continued fraction but the last term jc, is not an 
integer. Nevertheless, as noted earlier, the formalism of (3), (4) continues and one has for 
all 7 > 1 



r, , , 1 + A-2 

= = — 



JC, B,_, 4- B -_2 



(14) 



where the , A,_2 , , 5,_2 depend only on the integers . 

We now define an infinite continued fraction to be the formal infinite expression 



bn + - 






■ = [60,^1,62,... where is any integer and ,Z?2 ,... an infinite sequence 



of positive integers. Thus we have now associated with every irrational jc an infinite 
continued fraction, denoted CF[x ] . Consider a numerical example. Take Xq= x = -V3 . 
1 1 



Then b^ =- 2 , x, 



^0 ^0 2 — ^3 



At this point it is best to proceed with elementary 
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1 2 + V3 r 

algebra and rationalize the denominator: jc, = ^ 2 + -^3 Leaving 

^/5 + l I- 

the simple steps to the reader we then have Xj = — - — , = 1 , ^3 = V3 + 1 , = 2 , 



^4 = — 2 — ’ ^4 ” L But wait - X4 = ^2 , Z?4 = ^2 so then = x^, = b^ and everything 

repeats after two steps. Doing it directly on a calculator, without rationalizing 
denominators, I find x, =3,732050807, JC 2 = 1.366025404, JC 3 =2.732050806, 



x^ = 1 .366025407, so ^2 ^ x ^ ! So we’ve found CF[- V3 ] = [- 2,3,1,2,1,2, . . .] = 

[- 2,3, 1 , 2 ] , where the bar over the block 1,2 indicates its infinite repetition. In general, a 

continued fraction is called periodic if it has the form [cq,c^ . The 

block is called the period and k>\ is the length of the period. Cq,c, ,...,C/,_, 

is the pre-period and may be absent, in which case the continued fraction is called purely 
periodic. Note that for CF[x] = [z?o,Z7, purely periodic, the 6, , a,, have period k: 

b^^f. = = X, for all i > 0 and Z?. = b^ , x. = x . whenever / = y(mod k) . It is tacitly 

assumed that the period is minimal, does not consist of a repeated shorter block, and 
starts ‘as soon as possible.’ Thus, though Cf[- V 3 ] can be written [-2,3,1, 2, 1,2, l] the 
period is 1,2 and the pre-period -2,3. Because of periodicity we can describe all the 
terms. Euler found, for e the base of the natural logarithm, 

CF[e\ = [ 2 , 1,2, 1,1,4, 1,1, 6, 1,1,8,!...] and though not periodic there is a pattern after the initial 
Z?o = 2 and the terms are ‘known.’ On the other hand CF[ti\ = [3,7,1 5,1,292,1,1,1,2...] and 

no one has found a pattern to predict the terms. Apparently CF[x] in not known in its 
entirety for any algebraic number of degree >2. 

So far infinite continued fractions are only formal objects - like infinite series 
before one has defined the notions of convergence and sum. Considering again (14) 



along with (10) we have x^ - — ^ 



+ ^,_2 ^/-l 

+ B,_2 5„, 



±1 

+ B,_2) 



. Thus 
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1 



1 



B,_,\ B„,(z?,,x, + 5 ,_,) B,_ 

A. 



- , since x, >1 for / > 1 . But B^ = \< < B^ < • 



shows -> 00, lim— ^ = Xq. Thus the ‘partial’ finite continued fractions 

/-.CO B,_, 

[^0,^, converge to a limit, the limit being Xq. Moreover it can be shown 

that given any sequence of integers bQ,b^,... with b^ positive for / > 1 , then the finite 
continued fractions ] tend to a limit as / -> oo , which is then defined to be 

the value of the infinite continued fraction. In fact, the function that assigns to each 
infinite continued fraction its value is a bijection onto the set of all irrational numbers; if 

the value of [z?o, Zip... ] isacthen CF[x] = [^o,Z?p...]. The rationals are called the 

convergents of x, and give ‘best’ rational approximations; their study is the topic of 
diophantine approximation. 

It was not just luck that Cf[- Vs] turned out to be periodic. The classical 

theorem of Lagrange asserts that CF[x] is p-eriodic if and only if x is a quadratic 
irrational - the root of a quadratic equation with rational coefficients. Furthermore, 
Galois showed that CF[x] is purely periodic if and only ifx is reduced. Reduced means 
X > 1 and — 1 < x' < 0, where x' is the conjugate of x, the other root of the equation 

satisfied by x. For example, from Cf[- Vs] we had X2 = ^ ^ =[ 1,2 ] and X2 > 1 , 



- 1 < Xj = 



-V3 + I 



< 0 . 



Let J denote the set of irrational numbers, Ij the quadratic irrationals and R the 
reduced ones. The x, generated by CF[x] are called the ‘complete quotients’ ofx. The 
construction shows that if CF[x] = [^o,6p^2 »• ••] ^l^^n CF[x,] = p/?. ^2 »•••]» 'which 



might be called a ‘tail’ of CF[x] . Recall the group G; A/ = I , I e G acts on / by 

\c a) 
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sending x into M(x) = 7 . The map G x / -> / that sends {M,x) to M(x) is a 

cx + a 

standard group action. Under it / breaks up into G-orbits or equivalence classes; for 
x,y el we say y - x if y = M{x) for some M eG . Thus (14) shows x, ~ x and Serret 
proved the beautiful result that describes the G-equivalence class of x in terms of 
continued fractions: ~ x if and only if for some integers ij’ > 0 , yj = x. , that is y, x 

have identical ‘tails’ from some point on. 

From now on we shall be concerned only with quadratic irrationals and our goal 
is to show how the periodic structure of CF[x] reflects arithmetic properties. Numbers 
X G I 2 are classified by their discriminant. This word requires careful definition since it 
is used in different ways in various contexts, x is the root of a quadratic equation with 
rational coefficients. By multiplying, dividing the equation by integers one can arrange it 
to be in the form aX^ +bX + c = 0 with a, b, c integers, a>0 and gcd(a,b,c)=\; X is 
simply an indeterminate. This equation is now uniquely determined and we call it the 
standard equation for x. Then d = b^ - 4ac is called the discriminant of x, denoted 
i/=disc(x). For example, with x = -V3 , the standard eq aation is - 3 = 0 and d=\2. 
Note that any integer d arriving as disc(x) is positive, since x is real, not a (perfect) 
square, since x is irrational and d = b^ ^0 or l(mod 4). We call any positive integer d 
not a square and = 0 or 1 (mod 4) a discriminant. In fact, c/=disc(x) where x is a root of 

X^ = 0, if d = 0(mod4), and a root of X^ - X = 0, if d = l(mod4) . 

The first few discriminants are 5, 8, 12, 13, 17, 20, 21. Note, for later use, that if the 
discriminant d = D , s, D integers and Z) s 0 or l(mod 4), then D is also a discriminant. 

Consider ^ ^ ^ ~ ^ standard equation for x 

and c/=disc(x), then ±{cX^ +bX + a) = 0 is the equation fory', so disc(y)=t7. if 
(x) , m an integer, then x-z + m satisfies 




z-x~m- 
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a{z + m)^ + b{z + m) + c = 0 from which one deduces that the standard equation for z is 
+ BX + C = 0 where A=a, B=lam-^b, C=am^+bm+c whence 

disc(z)= B^ -4 AC = b^ - 4ac = d . Easier to see is that - x = ^ ^ ^{x) has disc(-x)=^f. 



1 6V0 1 



0 lAi oj u a 



and the results of Section 2 enable one to show that any 



(b \] f - 1 C] (0 n . 

M E.G may be expressed as a product of matrices of the type , ^ ^ i i 

VI 0/ V u ly VI Uy 



follows that if y = M(x) , M eG , then disc(y)= disc(x). Thus disc(jc) is an invariant of 



the G-equivalence class of ;c. We also see directly from jc = Xq = f — , that disc(x)= 

X, 

disc( ;c, ) and in general all the complete quotients jc, of x have the same discriminant. 

Thus we’ve shown that for x,y e I 2 , y -- x implies disc(y)= disc(jc). However, 
the converse is false, as we shall see. Given a discriminant d, let I^id) be the set of 
X e 1 2 having disc(x)=^/ and P{d) the set of reduced x having disc(jc)=(i. By what we 
have just shown, l 2 {d) is a union of G-orbits. By Lagrange’s theorem every x g /2 has 
some x^ which is purely periodic, hence in R by Galois, thus every x e Ijid) is G- 
equivalent to some y e R{d ) . In other words every G-orbit in l 2 {d) meets R(d ) . The 
number of G-orbits in l 2 {d) is called the class number of d, denoted h = h{d) and we 
will show h{d) is finite by showing R{d) is finite. But more than showing R{d) finite, 
we want to give an easy algorithm for listing the members of R{d) . 

Theorem Let be a discriminant. The set R{d) is finite and its members 
may be enumerated as follows. 

Let g be that one of \4d J , J - 1 that \s =d (mod 2). For 
p = g,g - 2,g - 4,... and >0 (so the last p is 2 or 1 according as d is even or odd) 
determine all ordered triples {p,q,q*) of positive integers satisfying 

q, q"^ are even, d - p^ +qq* , g - p + 2 < q < g + p and gcd{j q,p,jq*) = 1 . 



(15) 
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Assign to {p,q^q*) the number x = — — . Then x e R{d) and in this way 

each member of R{d) is listed exactly once. By this arrangement the standard equation 
iov x\s (jq)x^ - pX ~{jq*) Q . 

Proof Suppose x e R{d) and has standard equation aX^ + bX, + c = 0 . 

-b-^yfd -b-4d 



By the quadratic formula the roots are 



and the former is the larger 



2a ’ la 

root - by our standardization a>0. But x reduced says x>l, - l<jc' <0 so .x is the larger 

— b-¥ yfd c b 

root, X = — . Also 0 > xx' = — so c<0 and - — = x + x' >1-1 = 0 so ^<0. 

2a a a 

Setting p = -b , q = 2a , q* = -2c we have p,q,q* positive integers, q,q* even and 

“^J~d + p 

d = p^ +qq* . With this notation x = (we write the yfd first to emphasize the 

q 

role of the discriminant) and the standard equation is - pX - (i^*) = 0 . We now 

reverre direction and ask, given p,q,q* po.sitive integers, with q,q* even, d = p^ +qq* 

and god(jq,p,jq*) = 1 will x = — — e RidY? Note thatx is indeed a root of the 

q 

standard equation {\q)X^ - pX - (i^*) = 0 and has discriminant d, so we only need to 

j — — d “ 1 ” p 

check if x is reduced. Now x>l is equivalent to ^ < dd + p , x' = < 0 is 

q 

equivalent to /? < ^fd and x' > -1 is equivalent to yfd - p <q . Combining these we see 
that 



Q<-Jd-p<q<yfd+p (16) 

must be satisfied. We now want to eliminate the yfd . Since d = p^ +qq\ q,q* even, we 
have p = c/(mod 2) and 0< p < yfd , so that p is < the largest integer that is < ^fd and 
= c/(mod 2) , which is precisely g as defined in the theorem. Thus the possible p are as 
described. By definition of g, -Jd ;= g + ,9 with 0 < i9 < 2 and (16) is equivalent to 
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g-p+&<q<g-\-p + &. Noting that g-p, q, g^p are all even this is equivalent to 
g- p + 2<q<g-^ p . 

Our description shows that the conditions of (15) are precisely what is needed to 



yfd + p 

guarantee x = 



e R(d) . Since there are only finitely many values of p and for 



each p only finitely many values oiq,q^ this shows R{d) finite and the proof is 
complete. 

Note that qq* = d - p^ = (^\fd - p\-Jd + p) along with ( 1 6) shows that also 

4d - p <q* < yfd + p so our conditions are symmetric with respect to q,q*. Along with 

yfd + p . . , yfd -f p 1 .1 

X = in R(d) we also have ; — = e R(d) . If we set x = , then 



X -> X* is an involution of R(d) . 

Note from (15) that q-2 can occur only when g=p. Since d = g^ -{■ 2q* 
determines the even integer q* uniquely and the gcd condition is obviously satisfied, we 

see that , which we denote as z or z{d) is the unique member of R{d) with q=2,. 

Suppose in listing members of R{d) according to the theorem one has p,q,q* satisfying 

all conditions of (15) except gcd(y^,p,y^*) = j > 1 . Then q-sQ, p = sP, q* -sQ\ 

with Q, even and D= + QQ * , d - s^D so D is a discriminant and P, Q, Q* satisfy 

(15) for D. The inequalities in (1 5) are not so obvious though, but the proof showed they 

are equivalent to (16) and the terms there are homogeneous in s, so we obtain 

I — ! — 'yj d "I" Z7 ■%/ jf) + P 

0<yjD-P<Q<yiD-{-P. Thus — - — = — — — e R{D) . Asa consequence, if d 

cannot be written as , 5>1, Z) = 0 or l(mod4) the gcd condition of (15) needn’t be 
checked since it always holds. Such d are called fundamental (or field) discriminants; the 
first few are 5, 8, 12, 13, 17, 21. 
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It’s worth remarking tliat when presented with an x e disc(x) is often not 

^/7+2 -V 7+2 

obvious. For example, x = — - — > 1 and - 1 < jc' = < 0, so x e R . What is 

disc(jc)? Find the standard equation: (4x - 2)^ = 7 , 1 6jc^ -16x-3 = 0, d = 448 , 

V^ + 16 

X = — € /?(448) . From now on we always write a reduced x in the form 

yfd -f p 

described in the theorem, x = , ^/=disc(x), and call this the standard form of x. 

For d a discriminant we denote |/?(^/)|, the number of members of R(d) , by r(d ) . Since 
z{d) e R(d) , r{d) > 1 . We consider some numerical examples. 



d=31 is a prime, so the gcd condition need not be checked. J = ^ 
the range of/7 is/7=5, 3, 1 . 

/7=5 Solve qq* =37-5^ =12 with q,q* even and 2 < ^ < 10 . There are two 
solutions, 12 = 2*6 = 6*2. 

/7=3 Solve qq* =37-3^ =28 with q,q* even and 4 < g < 8 . There are no 
solutions. 



p=\ Solve qq* =37-1^ =36 with q,q^ even and 6<q <6 . There is only one 
solution, 36 = 6 * 6 . Thus r(37) = 3 , 

V37+1I 



I V^ + 5 ^/37+5 

R{-il) = \z = ,x = ,y- 



. Note that x = z* and y = y* . 



d=12 (not a fundamental discriminant) = 8 , g = 8 ,p=8, 6, 4, 2. 

/7=8 qq* =72-8^ = 8, 2<^<16, has solutions, 8 = 2*4=4*2, 
gcd(i(2),8,i(4)) = l. 
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p=6 qq' =72-6’ =36, A<q<\A, has one solution, 36 = 6*6, but 
gcd(^(6),6,y(6)) = 3 ; reject. 

p=4 qq* =12- =56, 6 < ^ < 12 , no solutions. 

p^2 qq* =12-2^ = 68, 8<^<10,no solutions. Thus r(72) = 2 , 




The function r{d) is extremely erratic. The reader may wish to calculate r(28l) 
and r(293) . For larger d one has r(l 109) = 1 1, r(l 129) = 51 , r(l 181) = 9, r(l20l) = 53 ; 
all these discriminants are prime. These results were not obtained by hand but by a 
computer program implementing the algorithm of the theorem. I want to thank my 
Lehman College student Mr. E. Moss for his assistance with the computer work. 

If ;t e R(d ) , CF[xj = is purely periodic, x^=x, x,,...jc are all 

and in R{d) . Suppose also e /?((7) , CF[>^] = [co,c, ,...,c,„_j ] . If jv ~ jc , by Serret’s 
theorem yj = x, , for some jj > 0 . But then for any « > 0 , y so choose n so 

that j + n = sm, 2 i multiple of m. By periodicity = >^ , so y = , and reducing 

/ 4- n mod k , say i + n = ^(mod k) , 0 < I < k - \ , jc,^„ = Xf,ox y = Xf is one of the k 

distinct complete quotients of x. Thus from CF[x] = ,...,6^_,] we deduce that the 

G-orbit of X meets R{d) in the set {xq = x,x, ,...x^_, } . We refer to this as the cycle of x, 
since it comes with a specific ordering determined by CF[x] . If one had started with 
some X,, 0</<A:-l, CF[x,] = and the cycle of x. is 

|x,,x,^,,...,x^_,,Xo = x,...,x,_j I . Thus the ordering is determined uniquely up to a cyclic 
permutation. If one then partitions R{d) into cycles, the number of cycles is precisely 
h{d ) , the class number of <7, defined earlier as the number of G-orbits of l 2 {d ) . 
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For example, starting with z = z,, = 



VJ7+5 



obtained above one finds 



CF[z(,] = [ 5 , 1 , 1 ], Z, =A-, Zj = y , 'm the notation of that example, [z^x^y] is the only 

cycle and h{37) = 1 . Observe that the period 5, 1, 1 after the first term is a palindrome, 1, 
1 , which reads the same backwards as forwards. This phenomenon always occurs for 

CF[z] for any d. CF[y] = [l,5,l] has the whole period as a palindrome, and y = — ^ , 



37 = -I- 6T Again this will always happen when q - q\ d - . In the literature 

these observations are usually made in connection with the study of Cf[V«], n not a 
square. We have examined these special symmetries in a recent paper [3]. 



The function h{d) is also quite erratic. Something is known concerning its 
divisibility by powers of 2 from Gauss’ genus theory of binary quadratic forms. Gauss 
also conjectured that h{d) = 1 occurs infinitely often and this still remains an open 
question. For the d mentioned earlier, h{\ 109 ) = 1 , h[\ 129 ) = 9 , /i(l I 8 I) = 1 , 

M12OI) = 1 . 

4. Quadratic Fields Significant aspects of purely periodic continued fractions 

can only be understood in the context of algebraic number theory. Actually, all that is 
needed are some basic notions of that theory as they pertain to real quadratic fields. We 
shall take these for granted, and recall a few points to fix the ideas. 



Let be a real quadratic field, the ring of algebraic integers in K. If x,y e K 
are linearly independent over the rationals they fonn a basis for K/Q; every X e K has a 
unique representation as X = rx + sy , r,s eQ . If we restrict ourselves to those X for 
which r,s eZ the resulting set L is a subgroup of the additive group of K and is a free 
abelian group of rank 2. We write L = {x,y) to indicate the construction of L via the 
basis xy^ for L/Z. L is called a (full) module in K. An order in is a full module that is 
also a subring of K and contains the ring Z. (This has nothing to do with the notion of 
order in the sense of ‘less than’ or ‘greater than.’) If L is a full module the set 
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{^eK\'^LciL) is called the coefficient ring L. Here is the set of all elements X 
ranging over L. The coefficient ring of L is in fact an order. 

Every K is obtained uniquely (up to isomorphism) as K = Q{J^) where m is an 
integer>l and square free. Set 



1 + Vm ,ifm=l(mod4) (17) 



= i 



2 



[Vm ’ if m = 2,3(mod4) 

Then 1, co is a basis for K/ Q and = (l,co) . Every order O is contained in 

Of. and is uniquely detemiined by its index / = and then O = (l,/co). For 

X e K , irrational, x' ^ x and x' eK . Defining x' = x for x eQ, the map x — > x' is the 
non-trivial automorphism of K! Q \ '\\ and the identity map constitute the Galois Group of 



K! Q . The discriminant of O, disc(O) is defined to be 



X 

x' 



y 

y 



= {xy'-x'yy 



where x, 



y is a basis for O. It is independent of the choice of basis. Using the basis 1 , fix) one 
obtains disc(0)= /^(ca - co')^ • By (17) then 



y- 2 ^ , if /77 = l(mod4) (18) 



disc(O) = 



[4/'m ’ 777 = 2,3(mod4). 

Thus disc(0)=t/, a positive integer, not a perfect square; m is the square free part 
of d. For/=l , O = Of^ .d 'ismor 4m, according as m = 1 (mod 4) or not. This also is 
called the discriminant of K, denoted df . . The numbers d,^ are the fundamental or field 
discriminants mentioned earlier. 

It should now be clear that if we allow O to range over all orders of all quadratic 
fields the resulting set of integers {disc(O)} is precisely the set of positive integers not 
perfect squares that are = 0 or l(mod 4), i.e. the set of numbers called ‘discriminants’ in 
the previous section arising as disc(x)= -4ac . Also, given d the numbers m,f are 
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uniquely recoverable, showing that there is a unique O with disc(0)=r/. We denote by 
Oj the order with discriminant d. For example, what is O, 2 ooo*^ 

Write 1 2000 = 2^* • 5^ • 2 • 3 • 5 , so m=30 is the square free part. 30s2(mod 4) so 
write 12000 = 4(2*5)^ -30 showing/=10. Thus O, 2 ooo = (l,10V^) contained in 



Since O is determined by its discriminant (we find) it is preferable to express the 
elements of Oj ‘canonically’ in tenns of d, rather than use the somewhat arbitrary basis 
l,yo}. We claim 



r -f s4d 

X G if and only if X = , r,5GZand ;=5<i(mod 2) 



(19) 



Note the congruence condition, so this is not a basis representation. 

To prove the claim, suppose m =l(mod 4). Then d = f^m ^ p ^ f (mod 2) and 

f f Ptii f + Pd ( 2 ^ + sf ) + spd 

fis) ~ = . Hence X iff X- t + sfid = with 



r + syfcJ 

or iff X = with r,seZ and r ^sf =.y<i(mod 2). The cases m =2,3 (mod 4) are 



proved similarly. Note that with X = g Oj then iVA, the norm of A, 



= XX^ = 




and this is an integer. Also we see directly that X g Oj iff X' eOj. 



The units in K are the algebraic integers rfor which — is also an algebraic 

T 

integer. Thus the units are the invertible elements in and form a group a 
subgroup of AT"' , the multiplicative group of non-zero elements of K. re O^^ is a unit iff 
Nr= T t’= ± 1 which is iff its standard equation has the form -\-bX ±1 = 0. 

If Oj is an order in K and re Oj n then — = ±x' shows that r is an invertible 

element of Oj . Thus it follows that Oj r\ is the group of invertible elements in Oj , 

denoted U j , called the unit group for discriminant d. By (1 9) we have that 
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r + s4d 

T = e LJ . 



iff r,s satisfy the Pell equation - ds^ - ±4 . Now it is not at all 



obvious that this equation has non-trivial solutions, i.e. other than r = ±2 , j = 0 , so not at 
all obvious that there are any units r other than the trivial ones ± 1 , which are the units 
for the rational field Q with the ring of integers Z. What we want to show here is that the 
existence of non-trivial units follows as a direct consequence of results about continued 
fractions, and the two theories shed light on each other. 

Let X eK be irrational, L the full module (x,l) . x, 1 is a basis for K/Q so 
multiplication by e A!' is represented by 

>.x = (xx + pl = oaf-+-p (20) 

X, = X*l = yx4-5'l = yA:-i-5 

( a 

with a, p, y, 5 rational. As long as the matrix M - 

vy 

M G GL(2,g) . We denote M = p(A.) and then the map p: K"" 

homomorphism - the proof makes use of the commutativity of /G . Recall that 
AA=detp(A) . p really should be denoted p^ , to show its dependence on the basis x,l. 



Pi 

IS invertible; 
oj 



GL(2,(2) is an injective 



Let aX^ +bX + c = Q be the standard equation for x and d=disc(x). Let O be the 
coefficient ring of L; by(20), XeO iff a,p,y ,6 g Z . Thus for XeO, X = yx + d and 



(yx + d)x = ox + p , or yjc^ + (6 - a)x - p = 0 . Comparing with the standard equation this 



forces y = na for some integer n, X = n{ax) + 5 g (ojc, 1) . Conversely, one sees that 

X g(ox,1) implies XeO. Thus 0 = (ajc,l) and 

± yfd 

a 



disc(G)= 



ax 1 
\ax' 1 



= a^{x-x'Y = a' 



= d . This proves the crucial connection 



if disc(x)=c/ then Oj is the coefficient ring of (x,l) . 



( 21 ) 



Let Q be the set of matrices A/ = g GZ,(2,<2) having a, p,y,5 gZ. Fora 



non-zero integer n let Q„ = { M g Q | dti{M)=n }; so our group G = Q, uQ_, . Suppose 
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/X fa 

now 6 O , as above, NX~n and p(>l) = Af = . Then M e Q„ and 

Xjc ouc + P 

X = — = — = Mix ) ; X is fixed under the action of A/ as a Mobius transformation. 

X yx + 5 ^ 

f a P^ cue + p 

Conversely, suppose M = e Q„ and x - M{x) = . Set A = yx + 6 , then 

Xx = (yx + 5)x = cor + p so XeO, A/ = p(X). If we make the definition A „={XeO | 
N?^=nj, and for any set of matrices S cz GL(2,Q) , S^={MeS | Af(x)=x} then we have 
shown that p maps A„ one-one onto . In particular with (/ = = A, u A_, , p 

maps f/bijectively onto Q, , u Q_,^ = . Since p is also a homomorphism we have 

that p: U -^G^ is an isomorphism. The inverse isomorphism we denote by (p:G^ -> f/ ; 
a 



for Af = I ' I e , cp( M) = yx + 5 eU . Thus if the group structure of G^ is 
^y oj 

determined then so is that of U. 

So far the only property of x that was used is that disc(x)=^/, so we are free to 
choose any X subject to this condition. We choose xeR{d), CF[x] = . 

For;>l set M„, = m[^)o, 6 , • By(14), x= With /=*, 

set A^ = Af;^., = m[ 6 o, 6 , and x^=x^ = x we have x = Af (x) , MeG^, Since 
A/ e G"" , it has infinite order and this proves the existence of non-trivial units. Let 
r = {± /} X { Af" = {± Af"} . Clearly T c G^ and we now show that actually F = G^ . 

Suppose V eG^, V ^ ±I . Let 9 (F) = ^ l . Then cp(± F) = , cp(± F"‘) = ±— and one 

of these is >1 . Let IF be that one of ± F , ± F~‘ for which (p(lF) > 1 . It will suffice now 

(a p') 

to show W eV . Suppose ^ gj ; set r) = (p(lF) = yx + 5 , p > 1 and v\eU . Thus 



r|' = ± — , |r|'| < 1 , then 0 < r| - ly = y(x-x') shows y > 0 (recall x here is reduced). 
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Also 6 = 1 ]'- yx' = ri' + |yx'| > r|' > -1 shows 6 > 0 . Also - \ < x' implies 
-y+6<yx' + 5 = r|'<l, hence -y+5<0, 5<y. Altogether then 0 < 5 < y . 
We now consider various cases. 

If 0 < 6 < y then by the lemma of section 2, setting a = det IT , 



CF 



= we have W = M[cQ,c^,...,c,,_^ . Again using (14), the value of 



the continued fraction = JV(x) and IV(x) = x , since fV eG^ . Thus 

X = [c„ , c, , . . . , c„_ , , x] , which says [6„ , 6, , . . . , 6* _, ] = [co , c, , . . . , c„_, . Since the 

continued fraction of an irrational is unique this forces Co,c, to be a y-fold 

repetition of the period block for some J>1. Thus W = M' eT . 

Remaining are the cases 0 = 6 < y = 1 and 0 < 5 = y = 1 . Rather than consider 
these directly it is easier to consider separately the two cases (a) Nv[ = -1 (b) Nv[ - 1 . For 

(a) Ar| = -1 , ly = - — < 0 . Our previous analysis showed - y + 5 < r|' hence now 



- y + 5 < 0 , 6 < y so we must have 6 = 0, y = 1 , W = 



a 

A 



P' 



0 )‘ 



But X] = (p(fV ) , 



p(r|) = fV, det = -1 so P = 1 . Thus ^ j q j ’ ^ ^ = a + — , hence 

a = [_xj , CF[x] = [a] . Hence k = \, bQ=a, W=M. 

For (b), q' = — > 0 . We saw previously that 6 > q' , hence now 6 > 0, so we 
(a p^ 

must have 6 = y = l,IT = l I. But det W = Nr\ = 1 then implies a - p = 1 , 

f P + 1 P^ X 1 

^ ~ V 1 I J ’ ^o a: = W{x) = P + — = P + I . Thus P = \_x] and 

Cqx] = [p,l,p,l,...]. If p>l then CF[a:] = [P],so A: = 2 ; = P, 6, = 1 , 
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M = M[1] = ^ and ^ ^ J ~ completes the proof that = T . 

Now U = cp(G^;r) = {± where e = (p(M) >1.8 clearly is the smallest unit >1 

and is called the fundamental unit for the discriminant d. We now write using (19), and 

t -I- uyfd 



keep this notation throughout, e = - 



t,u eZ , t = ud(mod2) . 



To summarize: to find s choose any x e R{d) - there is always at least 
: = ^ at hand - determine CF[x] = [z?o,Z7j,...,Z?^_j and set M = 



A-i A-A , X V^ + 5 

. Then e = (p(M) = . For example, with d=37, z = , 

^k-\ ^k-2-' 2 



we had (in the previous section) CF[z] = [5,l,l] . Thus M = 

I2 + 2V37 



11 6' 
2 1, 



E = 2z + 1 = 



- , /=12, u=2. Returning to the general case, e = 9(M) is 



in standard 



equivalent to M = p(e) which means that once e is known, M can be calculated as the 

+ p . 

matrix representing multiplication by e on the basis v,l . Write x = — 
form. Then straightforward computation shows 



provided one recalls d - • Thus 



A-\ ^k-2 

A-i K-2 



= M = p(e) = 



A + pu 1 * ^ 

"T" 2^“ 

t-pu 



V 2 



qu 



( 22 ) 



2 2 
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gives two independent presentations of M\ the one on the left depends only on 
CF[x] = right depends on knowing 8 . This gives a new 

way of calculating CF[x] , purely rationally, once 8 is known. Namely, since M eG'' , 
by our theorem of section 2 it has a unique representation as m[6o,^, found by 

r (/ + pu) / 2 



computing CF^ 



where a = det M = A^8 . Thus the period of CF[x\ is found 



quH 

by computing the continued fraction of a rational number. We state this as a theorem. 

t + -Jd p 

Theorem If 8 = is known, then for x = g R\d ) , CF[x\ may 



be found as follows. Let a = = ±1 , and compute CF^ 



\t + pu)l2 
qu / 2 



~ [^0 • 



Then CF[x] = 

Note from (22) and (10) that (- 1)‘ = det M = Nz so that the parity of k, the 
length of th'. period of CF[x] , depends only on d, not the specific a: g R{d ) . k is odd for 
all X G R{d) iff A^8 = -1 and is even for all x g l:{d) iff = 1 . 

Here is a numerical example. For ^/=1009, z = usual method 

of computation, CF[z\ = [ 3 1,2, 1,1, 1,1, 2 ] , k=l . Then by the method described in the table 

... - , (A ^3^ fl067 408') 1080 + 34V1009 

(6) one finds = 34z-f 13 = , /=1080, 

xr n ^ VIOO 9+25 

w=34, A^ 8 =- 1 . Consider now a: = 77 ; e /?(1009) ,/?=25, ^=12, ^ =32. Now, by 



12 



the theorem, we calculate CF_ 
CF[x] = [4,l,2,l,2,2,3,l,l' . 



965 

204 



: [4,l,2,l,2,2,3,l,l] and so 



For certain d, z is known a priori. Suppose = «^ + 4 , n>\ . Then 



n + ^ . 



unit in and is >1 . It must be the fundamental unit e , for otherwise it is z ' for some 
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j> 1 and then the coefficient of 4d would be greater than 1 . Thus e = 



n + 






, t=n, u=\ 



and Nz = -\. Note that in this case z{d) = s and since 8^-«e-l = 0, s = /7 + — shows 

8 

CF[z] = [/?] . This is, in fact, the only time the period has length 1 . By the theorem we 

yjd + p 



have that for x = - 



■ e R(d ) , CF[x] is found from CF_ 



{n + p)/ 2 
qll 



. For example 



904 = 30^ + 4 , X = is reduced and CF_ 



= [ 9 , 2 , 1 ] so CF[x] = [ 9 , 2 , 1 ] . 



Other cases where e is immediately known are: 



, 2n + 2yfd 

d = n n even >4,8 = , A^8 = -1 , 

, n + yfd 

d - - A ,n >A , 8 = — - — , A^8 = 1 , 



, 2n + 2y[d 

d - - \ ,n odd >5,8 = , A^8 = 1 . 



It is interesting to note that our determination of gives a new criterion for 
jc G : An irrational number x is a reduced quadratic irrational if and only if x is positive 
and iS'(A') = X for some matrix S . 

We have already seen that the condition is necessary. On the other hand, suppose 
5’(x) = X , S eG* . Then by the Theorem of Section 2, 5* = m[^o , for a 

uniquely detemiined sequence of positive integers. There is a unique, largest integer 
7>1 such that is ay-fold repetition of the initial /: terms , 

;? = jk . Let >^ = [z?o,/?| ]. Then, yeR and G,, is generated by 

M = m[^o,/?i But 5 = so S fixes y and y' . Since S has only two fixed 
points X = y or y' . But x>0, so x = y ,x is reduced. 
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5. G mod 2 Let F = Z mod 2 , the field with two elements 0, 1 . We use the 
usual integer symbols, so one should note, from the context, when the arithmetic is being 
done mod 2. In this section all congruences a = b are to be understood mod 2 unless 
indicated otherwise. The map n-> n mod 2 of Z onto F induces a homomorphism 



^:G-^GL[2,F)'^y 



b\\ (a modi Z)mod2^ 



, ^ . For M eG v/Q denote ^(M) by 

: mod 2 a mod 2J 



\\c dJJ \ci 

M , and GL{2, F) by G . One easily sees that G is a group of order 6 its elements being 

•""0 I )-*-!? !]• 



/ = 1^0 I I ’ identity element. 






1 0 
1 1 . 



have order 3, R,UR,U^R have order 2 and UR = RU^ . Thus 



G = 5*3 , the symmetric group of degree 3. U,U^ correspond to 3 -cycles and R,UR,U^R 
to transpositions. For M eG, S eG we also say M = S rather than M = S . For 



7 5^ f21 8' 

example 111 gJ-^’U 3 , 



the conjugacy classes of G . 



= /. 2, =(/}, ={U,U^}, Z, = {r,URM'R} are 



t + u-sfd 

Theorem Let dbtdL discriminant, e = ^ , / = , the fundamental unit. Let 

xe^R{d), CF[x]= [^ 0 ,^, , M = Then the conjugacy class of 

M eG depends only on d, as follows: 

(a) d = l(mod 8) implies M = J , 

(b) d = 5(mod8) implies M = 1 if w is even, and M eZ^ if w is odd. 

(c) d even implies M = J if w is even, and M eZ^^ if w is odd. 



Proof As usual, write x = ^ ^ ^ and M = 



^ t + pu 1 . ^ 
— - — —q u 
2 2 ^ 

1 t - pu 

— 



as in (22). If u is 



even, then both off-diagonal elements are 0 mod 2 and the only matrix in G with this 
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property is/; thus M - 1 whenever u is even. U d ^ l(mod8), -du^ = ±4 implies 
-u^ = 4(mod 8) . But if t,u are odd, = 1 - 1 = 0(mod 8) , hence w, and /, must be 

even. This proves (a) and the case u even in (b),(c). Now assume u odd. If 

d = 5(mod8) ^ d - +qq ,/? odd, so qq* = 5 - 1 = 4(mod8) . Thus ]^q , ^q* are both 
odd and the off-diagonal elements of Mare odd. Also, t = du = \ so trace(M)=/, is odd. 



, 1 1 

So one of the diagonal entries of M is even and one is odd. Thus -^ = 1^1 oj ^ ^ 



fO f, 

M= = // , i.e. M el> 2 ' If is even, then so is p, and gcd 

\1 I 






implies at least one of —q, — q* is odd. Now trace(M)= t = du = 0 so the diagonal entries 



are both even or both odd. But this characterizes M el ^ and the proof is done. 



Note that we have M = I if and only if u is even, Now G being isomorphic to 
iS'3 has a homomorphism G — > {±1} defined by assigning to each S eG its sign as a 
pemiutation. Thus 5 — > -1 if g E3 and .S' 1 otherwise. Composing with ^:G G 

we obtain a homomorphism, or character, x » lo {- 0 • j(M) = -1 for M e Z3 and 
^(M) = l for MeZ,ul 2 - 



Keeping with the notation of the theorem, we have 



(bo lY^ n (b,_, n (b n 

M = \ . ^ _... _. But = Rot U according as b is even or odd, so 

1 OA 1 Oy V 1 Oy \1 Oy 



Z?, 1, , 

Xl I 1 I I = -1 or 1 according as b^ is even or odd. Since x is a homomorphism 



x( M) = (— 1)‘' where is ^ = e(x) is the number of terms b^ in the period of CF\pc\ that are 
even. Combining with the theorem we thus have: e odd iff ;^(M) = -1 iff M el ^ iff d 
even and u odd. Thus d odd implies e even and d even implies e odd or even according 
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as u is odd or even. But this is precisely the theorem stated in the Introduction, as 
promised. 

Note that for d odd the result in terms of e is weaker than the result of the theorem 
of this section, since e does not differentiate between Z, and Zj . For d odd we always 
have e even and M = I if w even and M = U ox if u odd. 

We conclude with some numerical examples. 



(1 ) At the end of the previous section we had for d = 1009 = l(mod 8) , 



1080 + 34V1009 



, Nz = -\, w = 34 even. We found 



(a) CF\ 



vro^-f3i 

2 



•3 1,2,1,1,1,1,2] , /f=7, c=2, M = URU^R = I. 



’Vl009 +25] r 1 , 

(b) CF = [4,l,2,l,2,2,3,l,lJ , k=9, e=4, M = RURURHJ^ = 7 , all as 

predicted. 

(2) For d = 904 = 30^ + 4 , even, we saw CF — = [30], k=\, e=l, 

30 + 

8 “ 2 ’ ^ ^ ^ ^ M = R . Also 

"V^ + 26l r 1 

CF ^ =[9,2,lj,/^3,e=l, M ^ URU ^ R . 

(3) Consider x = [l0,2,lj; what can be said about c7=disc(x)? Heree=2 does not tell us 
too much but M = R^U = ^7 , so we know d = 5(mod 8) and u is odd. Also k=3, so 

f31 2\] 

Ns = ~\. Calculating M =\ ^ J , one can now find x by solving the quadratic 

equation M{x) = x . But it’s easier to proceed using (22) which gives 
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^ t + pu 1 . ^ 

—q u 

2 2 ^ 

1 i - pu 

V 2 ^" 2 2 



31 21 



^ . Since w = gcd|^-^w,pw ^ ^ 



/ + / 7 W / - / 7 l/ 



2 2’2 






gcd(3,29,2l)= 1 we have q=6, q*=42,p-29, d = p^ +qq* = 1093 = 5(mod8) and 

Vl^ + 29 j 33 + Vl^ 

_ /=3 H2=33 and z = . 

6 2 

(4) In the same way consider a: = [2,2,2,l,l,l,l,l] ; /:=8, Nz = \ . Since c=3 we have d even 

ri2i i5\ 



and u odd; M ^ = UR . We find M = 



V50 31 



i/ = gcd(50,90,75) = 5, 



ry=20, ^*=30,/7=18, ^/ = 18' + 20 x 30 = 924 , x = 

152 + 5V^ 

/= 121+31 = 152 , 8 = ; . 



V^ + 18 
20 



Finally 
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Abstract 

Let A be a set of integers. For every integer n, let r a, 2(71) denote the 
number of representations of n in the form n = ai + U2, where ai , a2 € A 
and ai <02- The function • Z — >• No U { 00 } is the representation 
function of order 2 for A. The set A is called an asymptotic basis of 
order 2 if is finite, that is, if every integer with at most a finite 

number of exceptions can be represented as the sum of two not necessarily 
distinct elements of A. It is proved that every function is a representation 
function, that is, if / : Z ^ No U { 00 } is any function such that /“^(O) 
is finite, then there exists a set A of integers such that f{n) = rA, 2 { 7 i) for 
all n G Z. Moreover, the set A can be constructed so that card{a G A : 
|a| < a;} ^ 



1 Representation functions 

Let N, No, and Z denote the positive integers, nonnegative integers, and inte- 
gers, respectively. Let A and B be sets of integers. We define the sumset 

A B — b d ^ A and ^ G .0}, 



and, in particular. 



2A = A A — {o-i + (3-2 ’ di, d2 G A^ 



and 

A b' = A -\- {^} — {^d 4 - 6 : (I G 
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The restricted sumsets are 

A^B = {a b : a E A, b € B , and a ^ b} 



and 



2 A A = A+A = {ai 4-^2 ^^1,02 ^ A and ai ^ a2}. 
Similarly, we define the difference set 

A — B = {a — b:a£A and b G B} 



and 

-A = { 0 } - A = {-a : -a G A}. 

We introduce the counting function 

A{y,x)= 1 . 

aeA 

yKaKx 

Thus, A{—x^x) counts the number of elements a ^ A such that \a\ < x. 

For functions / and g, we write / ^ if there exist numbers cq and xq 

such that |/(x)| > co\g{x)\ for all x > xq , and / <C ^ if \f{x)\ < co\g{x)\ for all 

X > Xq. 

In this paper we study representation functions of sets of integers. For any 
set A C Z, the representation function rA,2{'f^) counts the number of ways to 
write n in the form n = ai + a2 j where ai , a2 € A and a\ < U2 . The set 
A is called an asymptotic basis of order 2 if all but finitely many integers can 
be represented as the sum of two not necessarily distinct elements of A, or, 
equivalently, if the function 



'f' A,2 • Z — > No U {00} 



satisfies 

card(r^ 2(0)) < 00. 

Similarly, the restricted representation function rA,2{'^) counts the number 
of ways to write n in the form n — a\ + U2, where ai, tt2 G A and a\ <02. The 
set A is called a restricted asymptotic basis of order 2 if all but finitely many 
integers can be represented as the sum of two distinct elements of A. 

Let 

/:Z^NoU{oo} ( 1 ) 

be any function such that 

card(/“^( 0 )) < 00. ( 2 ) 

The inverse problem for representation functions of order 2 is to find sets A such 
that rA,2{n) = f(n) for all n e Z. Nathanson [ 4 ] proved that every function / 
satisfying ( 1 ) and ( 2 ) is the representation function of an asymptotic basis of 
order 2 , and that such bases A can be arbitrarily thin in the sense that the 
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counting functions A{—x,x) tend arbitrarily slowly to infinity. It remained an 
open problem to construct thick asymptotic bases of order 2 for the integers 
with a prescribed representation function. 

In the special case of the function /(n) = 1 for all integers n, Nathanson [6] 
constructed a unique representation basis, that is, a set A of integers with 
= 1 for all n G Z, with the additional property that A{—x^x) ^ logx. 
He posed the problem of constructing a unique representation bcisis A such that 
A(— X, x) ^ x^ for some a > 0. 

In this paper we prove that for every function / satisfying (1) and (2) there 
exist uncountably many asymptotic bases A of order 2 such that rA, 2 {'^) — /(^) 
for all n G Z, and A(—x, x) ^ It is not known if there exists a real number 
(5 > 0 such that one can solve the inverse problem for arbitrary functions / 
satisfying (1) and (2) with A{—x,x) ^ 

2 The Erdos- Turan conjecture 

The set A of nonnegative integers is an asymptotic basis of order 2 for No if 
the sumset 2 A contains all sufficently large integers. If A is a set of nonnegative 
integers, then 

0 < f'Aaip) < oo 

for every n G Nq. It is not true, however, that if 

/ : No ^ No 

is a function with 

card (/~^(0)) < oo, 

then there must exist a set A of nonnegative integers such that = /(^) 

for all n G No. For example, Dirac [1] proved that the representation function 
of an asymptotic basis of order 2 cannot be eventually constant, and Erdos 
and Fuchs [3] proved that the mean value ta, 2 {'^) of asymptotic basis 

of order 2 cannot converge too rapidly to cx for any c > 0. A famous con- 
jecture of Erdos and Turan [2] states that the representation function of an 
asymptotic basis of order 2 must be unbounded. This problem is only a special 
case of the general inverse problem for representation functions for bases for 
the nonnegative integers: Find necessary and sufficient conditions for a function 
/ : No -> No satisfying card (/“^(O)) < oo to be the representation function of 
an asymptotic basis of order 2 for No. 

It is a remarkable recent discovery that the inverse problem for represen- 
tation functions for the integers, and, more generally, for arbitrary countably 
infinite abelian groups and countably infinite abelian semigroups with a group 
component, is significantly easier than the inverse problem for representation 
functions for the nonnegative integers and for other countably infinite abelian 
semigroups (Nathanson [5]). 
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3 Construction of thick bases for the integers 

Let [x] denote the integer part of the real number x. 

Lemma 1 Let f : Z No U {oo} be a function such that /“^(O) is finite. 
Let A denote the cardinality of the set /~^(0). Then there exists a sequence 
U = of integers such that, for every n G Z and k €N, 

/(n) = card{/c > 1 : Uk = n} 



and 



\uk\ < 



A:4- A 
2 



Proof. Every positive integer m can be written uniquely in the form 

m = s^+s-l-l + r, 

where s is a nonnegative integer and |r| < s. We construct the sequence 



y-{0, -1,0, 1,-2, -1,0, 1,2, -3, -2, 



-1,0, 1,2,3,...} 



where 

= r for |r| < s. 

For every nonnegative integer k, the first occurrence of —k in this sequence is 
= —k, and the first occurrence of k in this sequence is = k. 

The sequence U will be the unique subsequence of V constructed as follows. 
Let n G Z. If /(n) = oo, then U will contain the terms for every 

s > \n\. If /(n) = i < oo, then U will contain the £ terms Vs^^s+i+n foi" 
s = |n|, |n| + 1 , . . . , |n| +^ — 1 in the subsequence U, but not the terms Vs 2 j^s-\-i+n 
for s > \n\ Let mi < m2 < m3 < • • • be the strictly increasing sequence 
of positive integers such that is the resulting subsequence of V . Let 

U = where Uk - Then 

f{n) = card{A: >l\Uk=n}. 

Let card (/~^(0)) = A. The sequence U also has the following property: If 
\uk\ = n, then for every integer m ^ /~^(0) with |m| < n there is a positive 
integer j < k with uj = m. It follows that 

{0, 1, -1,2, -2, . . . ,n - 1, -(n - 1)} \ /-^(O) C {«i,U 2 , . . .,Uk-i}, 



and so 



This implies that 



A: - 1 > 2(n - 1) + 1 - A. 



|itfc| = n < 



k + A 



2 
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Since Uk is an integer, we have 



|wfc| < 



'A; + A 
2 



This completes the proof. □ 



Lemma 1 is best possible in the sense that for every nonnegative integer A 
there is a function / : Z — ^ Nq U {oo} with card (/“^(O)) = A and a sequence 
U — of integers such that 



\Uk\ = 



k + A 
2 



for all A: > 1. 



( 3 ) 



For example, if A = + 1 is odd, define the function / by 



f{n) 



r 0 if |n| < (5 
\ 1 if jn| > (5 + 1 



and the sequence U by 



U2i—i = 5 

U2i — + 0 



for all z > 1. 

If A = is even, define / by 



f(n) = 



0 if —S < n < S — 1 

1 if n > 5 or n < —S — 1 



and the sequence 1/ hy ui = S and 



U2i = s -h i, 

'^2i+l = —(S + i) 

for all z > 1. In both cases the sequence U satisfies (3). 
Theorem 1 Let / : Z ^ Nq U {oo} he any function such that 

A = card(/“^(0)) < oo. 



Let 



c — 8 -}- 



A + r 

2 



There exist uncountahly many sets A of integers such that 
f'A, 2 {^) = /(^) for all n e Z 



A{-x,x) > 



1/3 



and 
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Proof. Let 



A = card(/ ^(0)). 



By Lemma 1, there exists a sequence U = of integers such that 



f{n) = card({z G N : = n}) for all integers n (4) 

and 

k -h A 

\uk\ < — - — for all k > 1. (5) 

We shall construct a strictly increasing sequence of positive integers 

and an increasing sequence of finite sets of integers such that, for all 

positive integers k, 

(i) 

\Ak\ = 2k, 

(ii) There exists a positive number c such that 

Ak C [—ck^^ck^] 



(iii) 



rAk, 2 (n) < f(n) 



(iv) For j = 1, . . . , k, 



for all n e z, 



rAkfi{uj) > card({i < ik : Ui = uj}). 



Let be a sequence of finite sets satisfying (i)-(iv). We form the 

infinite set 

OO 

A=\jAk. 

k=l 

Let X > 8c, and let k be the unique positive integer such that 

ck^ < X < c{k -f 1)^. 

Conditions (i) and (ii) imply that 



Since 



A{-x,x) > \Ak\ = 2k > 2 ^ - 2 > 



/(n) = lim card({z <ik-Ui = n}), 

fc— >oo 



, 1/3 



conditions (iii) and (iv) imply that 



TAain) - lim rAk, 2 {n) = /(n) 

«— >oo 



for all n G Z. 
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We construct the sequence as follows. Let z'l == 1. The set Ai will 

be of the form Ai = {a\ + zzii,— ai}, where the integer a\ is chosen so that 
2Ai D /“^(O) = 0 and ai -f ^ —ai. This is equivalent to requiring that 

2ai ^ (/-*(0) - 2ui,) U (-r'(O)) U (6) 

This condition excludes at most 1 -h 2 A integers, and so we have at least two 
choices for the number a\ such that \ai\ < 1 + A and ai satisfies (6). Since 
l^ii I = 1^1 1 < (1 + ^)/2 and 



1^1 + I < \(^i I + I < 



3(1 -h A) 
2 



it follows that Ai C [— c, c] for any c > 3(1 + A)/2, and the set Ai satisfies 
conditions (i)-(iv). 

Let k > 2 and suppose that we have constructed sets Ai, . . . , Ak-i and 
integers i\ < • ■ < ik-i that satisfy conditions (i)-(iv). Let ik > ik-i be the 
least integer such that 

TAk-^,2{Ui^) < /(Uu). 

Since 



it follows that 
Also, (5) implies that 



ne{ui } 

tiG Z 

^2k - 1 > 

2 

< 2^, 

ik < 2k^ 
ik A ^ 



I'^ik I — 






2 - 2 
We want to choose an integer ak such that the set 



(7) 



Ak — Ak—i U {fl/j H“ Uif ^ , U'fc } 
satisfies (i)-(iv). We have |Aa;| = 2A: if 



and 



or, equivalently, if 



dk "1“ ^ CLk 

Ak -1 n {dk Aui^,-dk} = 0, 



^k ^ U (^Ak—i '^ik) LI { 'U2^/2}. 



( 8 ) 
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Thus, in order for Ak-i U {ak —dk) to satisfy condition (i), we exclude at 

most 2\Ak-i \ + I = 4:k — Z integers as possible choices for ak. 

The set Ak will satisfy conditions (iii) and (iv) if 

2Aifen/-i(0) -0 



and 



for all n G 2Ak-i \ {ui^} 
rAk-iA'^) + 1 forn 

1 for all n G 2Ak \ {2Ak-i U {ui^}) . 

Since the sumset 2Ak decomposes into 

2Ak — 2 (^Ak—i U {or/j + Tiffc ) 

== 2Ak-i U {Ak-i -h {ak + Ui,^, —dk}) U 2a^ -h 2ui^, -2ak}, 
it suffices that 



{Ak-i + {dk -f Ui ^ , —dk}) n 2Ak-i — 0, (9) 

{Ak-i 4- {dk 4- Wife, —dk}) n /“^(O) = 0, (10) 

{Ak-i + dk Ui^) n {Ak-i — dk) = 0, (11) 

{2ak+2ui,,-2ak}n2Ak-i -0 ( 12 ) 

{2ak 4- 2ui,,-2ak] H f~\0) = 0 (13) 

{2ak 4- 2ui^,-2ak] H {Ak-i 4- {dk 4- Uij^,-ak}) = 0. (14) 



Equation (9) implies that the integer ak must be chosen so that it cannot be 
represented either in the form 



dk = Xi X2 - Xs - Ui 



or 

dk X\ X 2 X"^ , 

where a:i,X 2 ,X 3 G Ak-\. Since card (A^_i) = 2(/: — l), it follows that the number 
of integers that cannot be chosen as the integer ak because of equation (9) is at 
most 2(2(A; — 1))^ = 16(A: — 1)^. 

Similarly, the numbers of integers excluded as possible choices for ak because 
of equations (10), (11), (12), (13), and (14) are at most 4A(/c — 1), 4(A: — 1)^, 8(A: — 
1)^,2A, and S{k — 1), respectively, and so the number of integers that cannot 
be chosen as ak is 

16{k-l)^ -f 12(A: - 1)2 4 - (4A 4- 8)(A: - 1) + 2A 
= 16A;^ - Sdk^ 4- (32 4- 4A)A: - 2A - 12 
< (16 4- A)k^ - 4^2 _ s2k{k - 1) - 2A - 12. 
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Let 



c — 8 + 



‘A + r 



The number of integers a with 
"A + 1 



\a\ < ck^ — — 



= 8 + 



A + 1 



k^-e- 



A + 1‘ 



(15) 



IS 



^16 + 2 - 2 k '^ -2 



A + 1 



+ 1 



> (16 + A)k^~ 2k‘^ - A. 

If the integer a satisfies (15), then (7) implies that 



It follows that there are at least two acceptable choices of the integer ak such 
that the set Ak = Ak~i U {ak + satisfies conditions (i)-(iv). Since 

this is true at each step of the induction, there are uncountably many sequences 
that satisfy conditions (i)-(iv). This completes the proof. □ 

We can modify the proof of Theorem 1 to obtain the analogous result for 
the restricted representation function rA, 2 {n). 

Theorem 2 Let / :Z->NoU{oo} be any function such that 

card(/~^(0)) < oo. 

Then there exist uncountably many sets A of integers such that 
^> 1 , 2 (^) = /(^) all n G Z 



and 

A{—x,x) ^ 



4 Representation functions for bases of order h 

We can also prove similar results for the representation functions of asymptotic 
bases and restricted asymptotic bases of order h for all /i > 2. 

For any set A C Z, the representation function rA,h{'^) counts the number 

of ways to write n in the form n = ai + U 2 + h , where ai , U 2 , . . • , ^ A 

and ai < U 2 < • • • < a/i- The set A is called an asymptotic basis of order h if all 
but finitely many integers can be represented as the sum of h not necessarily 
distinct elements of A, or, equivalently, if the function 



'f' A,h • Z — No U {oo} 
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satisfies 

card(r;^’^^(0)) < oo. 

Similarly, the restricted representation function rA,h{'^) counts the number of 
ways to write n as a sum of h pairwise distinct elements of A. The set A is 
called a restricted asymptotic basis of order h if all but finitely many integers 
can be represented as the sum of h pairwise distinct elements of A. 

Theorem 3 Let f : X Nq U { 00 } be any function such that 

card(/“^(0)) < 00 . 

There exist uncountably many sets A of integers such that 
'f'A.hi'n) = f{n) for all n ^ Z 

and 

A{-x,x) > 

and there exist uncountably many sets A of integers such that 
'^A,h{'n) = f{n) for all n £ Z 

and 

A{—x,x) ^ 
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Abstract 

A Sidon set is a set A of integers such that no integer has two es- 
sentially distinct representations as the sum of two elements of A. More 
generally, for every positive integer g, a ^ 2 [^]-set is a set A of integers 
such that no integer has more than g essentially distinct representation- 
s cis the sum of two elements of A. It is proved that almost all small 
subsets of {1, 2, . . . ,n} are ^ 2 [^]-sets, in the sense that if B 2 [g](kjn) de- 
notes the number of B 2 [^]-sets of cardinality k contained in the interval 

{1,2,... ,n}, then lim„_Kx) ^ 2 [g](k, n)/(l) = 1 if fc = o . 

1 Sidon sets 

Let A be a nonempty set of positive integers. The sumset 2 A is the set of all 
integers of the form a\ + a2, where ai,a2 G A. The set A is called a Sidon set 
if every element of 2 A has a unique representation as the sum of two elements 
of A, that is, if 

and 

Oi -h (22 = (2i -f O25 

and if 

oi < 02 and a\ < 02, 

then 

oi = a'l and 02 = 0,2- 
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More generally, for positive integers h and g, the h-fold sumset hA is the set of 
all sums of h not necessarily distinct elements of A. The representation function 
i'A,h{m) counts the number of representations of m in the form 



m — a\ -h <22 H" * • ' + tt/i , 



where 

ai e A for alH = 1, 2, . . . , h, 

and 

^1 ^ <22 ^ ^ < 2 / i - 

The set A is called a B/,[g]-set if every element of hA has at most g representa- 
tions as the sum of h elements of A, that is, if 



rA,h{m) < g 



for every integer m. In particular, a jB 2 [l]-set is a Sidon set, and Bft[l]-sets are 
usually denoted Bh-sets. 

Let h > 2. Let A be a nonempty set of integers, and a e A. Then r/j(m -I- 
a) > rh-i{m). Therefore, if rA,h-i(m) > g for some m £ (h - \)A, then 
rA,h{m + a) > g for every a £ A. It follows that if yl is a 5/i[g]-set, then A is 
also a Bft_i[^]-set. In particular, every Bh- set is also a Bh-\-set. 

Let A be a subset of {l,2,...,n}, and let |A| denote the cardinality of A. 
Then hA C {h,h -f- If |yl| = k, then there are exactly 

ordered /i- tuples of the form (ci, . . . , a*), where Uj 6 A for alH = 1, . . . , h and 
oi < • • • < a/,. If A is a B/j[ 5 ]-set and |A| = k, then 



k^ f k A h — 1\ ^ ^ 

■^ < ( ^ ’rA,h{m) < g\hA\ < ghn, 

^ ' m€hA 



and so 

|>1| = A: < 

for c = {Mghf^'^ . It follows that if is a “large” subset of {1,2,..., n}, then 
A cannot be a B/,[g]-set. In this paper we prove that almost all ’’small” subsets 
of ( 1 , 2 ,..., n} are B 2 [ 5 ]-sets and almost all ’’small” subsets of {1,2,..., n} are 
Bft-sets. 

Notation. If {u„}^i and {u„}^i are sequences and > 0 for all n, 
we write Un - o{vn) if lim„_^oo 'Wn/i’n = 0, and = 0(i;„) or if 

I'^nl < cVn for some c > 0 and all n > 1. The number c in this inequality is 
Ccdled the implied constant. 



2 Random small 52[^]-sets 

We require the following elementary lemma. 
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Lemma 1 If n > 1 and 0 < j < k < n, then 



Proof. We have 




since 

for i 0, 1, . . . ,n — 1. □ 




{n- j)\k\ _ k - i ^ fkV 
{k — j)\n\ ji — { — j 

k — i ^k 
n — i ~ n 



Theorem 1 For any positive integers g^k, and n, let B 2 [g]{k,n) denote the 
number of B 2 [g]-sets A contained in {1, . . . ,n} with \A\ = k. Then 



B2[g]{k,n) > Q 



^ _ Ak‘^3+2 



Proof Let A be a subset of {1,2, of cardinality k. If A is not a 

^ 2 b]-set, then there is an integer m < 2n such that r^, 2 (^) > 9, that is, m 
has at least 5 ^ + 1 representations as the sum of two elements of A. This means 
that the set A contains ^ -f 1 integers ai , . . . , such that 

I ^ <2l <C * ' • <C Up_|_i < — , 

and A also contains the g-\-l integers m — Oi ior i = I, . . . , ^ + I. If < m/2, 
then 

\{ai,m- ai : i = I, . . . , g + l}\ = 2g - 2. 

If = ^/2, then 



\{ai,m- ai:i = l,...,g + l]\ = 2g-l. 

Therefore, for each integer m, the number of sets A C {l,...,n} such that 
\A\ — k and r^, 2 (^) > ^ + I is at most 

(i^]\ (n-2g-2\ /[n^]\ fn-2g- l\ 

\g + ljyk-2g-2j^\ g jyk-2g-lj’ 

and so 



- B2[g]{k,n) < 




n — 2g — 2\ 
k-2g-2j 



-f- 



S„CT)( 



m<2n 



n - 2c/ - l\ 
k-2g -l) 
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Observing that 



m<2n ^ ^ ' m<2n 

and 

^ ('“f')<2n«« 

m<2n \ ^ / 

and applying Lemma 1, we obtain 

- I /n\ + /n\ 

m<2n Y VA;/ VA;/ 



a) 



/jL\2s^+2 2p+l 

<2„.«(-) +2„.«(-) 

4 ^ 29+2 



m—1 ' 



2 



-) 

nj 



2g+l 



n3 

This completes the proof. □ 



Theorem 2 Let he a sequence of positive integers such that < n 

for all n and 

kn = o . 

Then 

^2[g](fcn.n) _ J 



(£) 

Proof. This follows immediately from Theorem 1. □ 



Theorem 3 Let B 2 {k,n) denote the number of Sidon sets of cardinality k con- 
tained in n). If kn = o (n^/^) , then 



lim 

n—¥oo 



B2ikn,n) 

LZ) 



= 1 . 



Proof. This follows immediately from Theorem 2 with g = 1. \3 



Theorem 4 Let {/cn}^i 
for all n and 



he a sequence of positive integers such that kn < n 
kn = o . 




267 



T.k<k„B2[g\{k,n) ^ 

™ E.<.„ il) ~ 

Proof. It suffices to show that 

Sxt. ((t) -Bi[s](k,n)) 

Ex». O 

where f{k) is defined in the proof of Theorem 1. If ai, . . . , a^, 6i, . . . , are 
positive real numbers and B = max(&i then 

bi + ■■■ + be ~ B ~ be 



This implies that 

Efc<fc„ (*) 



4k2g+2 



~ nS 



fl9/ ( 2 ^+ 3 ) 



= o{n), 



and the proof is complete. □ 



We can restate our results in the language of probability. Let fl be the 
probability space consisting of the (^) subsets of n} of cardinality k, 

where the probability of choosing A 6 fi is l/(”). If Ph,g{k,n) denotes the 
probability that a random set A 6 is a S/i[ 5 ]-set, then Theorem 2 states that 

lim P 2 n{kn,n) = 1 

n— >oo 

if =o(n»/(2s+2)^ 

Similarly, Theorem 4 states that if fc„ = o and if Ph,g{kn,n) 

denotes the probability that a random set A C {1, . . . , n} of cardinality \ A\ < kn 
is a .B/,[ 5 ]-set, then 

lim P 2 ,g{kn,n) = 1. 

n — >^oo 



3 Random small Bh-sets 

A set A is called a Bh-set if rA,him) = 1 for all m e hA. Let Bh{k,n) denote 
the number of Bh-sets of cardinality k contained in {1, . . . , n}. Since every set 
of integers is a Bi-set, and every B^ set is a B*_i-set, it follows that 




Bi{k,n) > ■■■ > Bh.-i(k,n) > Bh{k,n) > ■•■ 
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We shall prove that almost all “small” subsets of {1, ... ,n} are Bh-sets. The 
method is similar to that used to prove Theorem 2, but, for /i > 3, we have to 
consider the possible dependence between different representations of an integer 
n as the sum of h elements of A. This means the following: Let (ai, . . . , a^,) and 
(a'l, . . . , aj^) be h-tuples of elements of A such that 

CLi H" • • • + a/i = a'l + • • • -h ajj , 



ai < • • • < tt/i, 

and 

{ui , . . . , dh } LI , . . . , U/j^} ^ 0. 

If h = 2, then = a- for z = 1, 2, but if h > 3, then it is not necessarily 
true that = a’ ■ for alH = 1, . . . , h. For example, in the case h = 3 we have 
l + 3-f4 = 1 + 2 + 5 but (1,3,4) ^ (1,2,5). In the case h — 5 we have 
l + l + 2 + 3 + 3 = 1 + 2 + 2 + 2 + 3 but (1, 1, 2, 3, 3) ^ (1, 2, 2, 2, 3) , even though 
{1,1,2,3,3} = {1,2, 2,2,3}. 

Because of the lack of independence, we need a careful description of a rep- 
resentation of m as the sum of h not necessarily distinct integers. We introduce 
the following notation. Let A be a set of positive integers. Corresponding to 
each representation of m in the form 



m = ai H h a/i, (1) 

where ai, . . . G A and < • • • < u/i, there is a unique triple 

(r,(h,).(a')), (2) 

where 

(i) r is the number of distinct summands in this representation, 

(ii) {hj) = (/ii, . . . , h^,) is an ordered partition of h into r positive parts, that 
is, an r-tuple of positive integers such that 

h = h\ + • • • + 

(hi) (a') == (a'l, . . . ,a^) is an r-tuple of pairwise distinct elements of A such 
that 

1 < a'l < • • • < < m 

and 

{o-l , . . . , <X/i} = {fl]^ , . . . , , 

(iv) 

m = hia'i + • • • + 

where each integer occurs exactly hj times in the representation (1). 
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There is a one-to-one correspondence between distinct representations of an 
integer m in the form (1) and triples of the form (2). Moreover, for each r and 
m, the integer is completely determined by the ordered partition {hj) of h 
and the (r — l)-tuple (a^, . . . Therefore, for positive integers m and r, 

the number of triples of the form (2) does not exceed Ti:r{h)rrf~^ , where 7r^(h) 
is number of ordered partitions of h into exactly r positive parts. 

Theorem 5 Let h >2. For all positive integers k < n, 

Bh-i{k,n) - Bh{k,n) < ^ 

and 

where the implied constants depend only on h. 

Proof. Let ^ be a contained in n}. Then hA C {h,h + 

1, . . . , hn}. If m G hA and m has two distinct representations as the sum of h 
elements of A^ then there exist positive integers ri and V 2 and triples 

and (^ 2 , (/iaj), (a' 2 j)) (3) 

such that, for z = 1 and 2, we have 

Ti 

TO = ^ hija'ij, 
n 

h = ^ ^ 

j=i 

and 

1 < < • • • < < m. 

The number of pairs of triples of the form (3) for fixed positive integers m, 
ri < /i, and V 2 <his at most 

where the implied constant depends only on h. Moreover, since A is a jB/i_i-set, 
no number can have two representations as the sum of h — 1 elements of A. This 
implies that 

{tti 1 , <^1^2? • • * ’ ^l,ri } {^2,15 ^2,25 • • • > ^2,T2} ~ 

and so the set 

{a' ^ : z = 1,2 and j 1,. . . ,rj 
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contains exactly ri +r2 elements of A. Therefore, given positive integers ri < h, 
T2 < h and m < hn, there are 






n — ri —r2 

k - ri - T 2 



sets A for which m has two representations as the sum of h elements of A, and in 
which one representation uses ri distinct integers and the other representation 
uses T2 distinct integers. Summing over m < hn, we obtain 






/n - n - T2\ 
\k-ri -r2/‘ 



Applying Lemma 1, we obtain 



Bh-i{k,n) - Bh{k,n) 




n 



ri+r2 — l (n—ri—r2 



( n—ri —r2\ 
k—r\ —T2 / 



(^) 

/ lc \ 

^n+r2-l / « j 






^ri+r2 






r\ ,7'2'^h 

n \ k‘^^ 



n 



k I n 



It follows that 



and so 



- Bh{k,n) = Bi{k,n) - Bh{k,n) 






i=2 






/ 7 i\ k!^^ 






^n\ k‘^^ 
n 



Bh(k,n) > 



1 - 






n 



This completes the proof. □ 



Theorem 6 Let Bh{k^ n) denote the number of Bh-sets A contained m {1 , . . . , n} 
with \A\ = k. Let be a sequence of positive integers such that 



o 



l/ 2 h\ 
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Then 



lim 

L) 



Proof. By Theorem 5, 



1 \ Bh{kn^n) ^ f kn \ 

U'/'v 



2h 



il) 



and so 



y ^h{kYijTl) 

hm — = 1. 

(fcj 



This completes the proof. □ 



4 Remarks added in proof 

A variant of Theorem 3 appears in Nathanson [2, p. 37, Exercise 14]. Godbole, 
Janson, Locantore, and Rapoport [1] used probabilistic methods to obtain a 
converse of Theorem 6. They proved that if 



lim . 

n->oo ri^l 



— oo, 



then 



lim = 0. 



(fe"J 

They also analyzed the threshold behavior of Bh{k,n), and proved that if 

kn 



then 



lim ■ = A > 0, 

n— )-oo 



hm = e-\ 






where X = 

It is natural to conjecture that analogous results hold for the function Bh[g]{k, n), 
namely, if 

= 0 , 



then 



and if 



n9/(9h+h) 



lim = 1, 

— CJ 



n-H’So n9/i9h+h) 



= OO, 
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then 



lim 

n—^oo 



Bh{kn,n) 

(;.) 



= 0. 



It should also be possible to describe the threshold behavior of Bh{kn,n)j[^) 
in the case 



lim 

n— >oo 



Bh{kn,n) 

(a 



A > 0. 
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